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Preface 


Universal algebra has enjoyed a particularly explosive growth in the last twenty years, and 
a student entering the subject now will find a bewildering amount of material to digest. 

This text is not intended to be encyclopedic; rather, a few themes central to universal 
algebra have been developed sufficiently to bring the reader to the brink of current research. 
The choice of topics most certainly reflects the authors’ interests. 

Chapter I contains a brief but substantial introduction to lattices, and to the close con- 
nection between complete lattices and closure operators. In particular, everything necessary 
for the subsequent study of congruence lattices is included. 

Chapter II develops the most general and fundamental notions of universal algebra— 
these include the results that apply to all types of algebras, such as the homomorphism and 
isomorphism theorems. Free algebras are discussed in great detail—we use them to derive 
the existence of simple algebras, the rules of equational logic, and the important Mal’cev 
conditions. We introduce the notion of classifying a variety by properties of (the lattices of) 
congruences on members of the variety. Also, the center of an algebra is defined and used to 
characterize modules (up to polynomial equivalence). 

In Chapter III we show how neatly two famous results—the refutation of Euler’s con- 
jecture on orthogonal Latin squares and Kleene’s characterization of languages accepted by 
finite automata—can be presented using universal algebra. We predict that such “applied 
universal algebra” will become much more prominent. 

Chapter IV starts with a careful development of Boolean algebras, including Stone du- 
ality, which is subsequently used in our study of Boolean sheaf representations; however, 
the cumbersome formulation of general sheaf theory has been replaced by the considerably 
simpler definition of a Boolean product. First we look at Boolean powers, a beautiful tool 
for transferring results about Boolean algebras to other varieties as well as for providing a 
structure theory for certain varieties. The highlight of the chapter is the study of discrimi- 
nator varieties. These varieties have played a remarkable role in the study of spectra, model 
companions, decidability, and Boolean product representations. Probably no other class of 
varieties is so well-behaved yet so fascinating. 

The final chapter gives the reader a leisurely introduction to some basic concepts, tools, 
and results of model theory. In particular, we use the ultraproduct construction to derive the 
compactness theorem and to prove fundamental preservation theorems. Principal congruence 
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formulas are a favorite model-theoretic tool of universal algebraists, and we use them in the 
study of the sizes of subdirectly irreducible algebras. Next we prove three general results on 
the existence of a finite basis for an equational theory. The last topic is semantic embeddings, 
a popular technique for proving undecidability results. This technique is essentially algebraic 
in nature, requiring no familiarity whatsoever with the theory of algorithms. (The study 
of decidability has given surprisingly deep insight into the limitations of Boolean product 
representations. ) 

At the end of several sections the reader will find selected references to source material 
plus state of the art texts or papers relevant to that section, and at the end of the book one 
finds a brief survey of recent developments and several outstanding problems. 

The material in this book divides naturally into two parts. One part can be described 
as “what every mathematician (or at least every algebraist) should know about universal 
algebra.” It would form a short introductory course to universal algebra, and would consist 
of Chapter I; Chapter II except for §4, 812, 813, and the last parts of 811, 814; Chapter 
IV 81-4; and Chapter V 81 and the part of §2 leading to the compactness theorem. The 
remaining material is more specialized and more intimately connected with current research 
in universal algebra. 

Chapters are numbered in Roman numerals I through V, the sections in a chapter are 
given by Arabic numerals, $1, 82, etc. Thus I1§6.18 refers to item 18, which happens to 
be a theorem, in Section 6 of Chapter II. A citation within Chapter II would simply refer 
to this item as 6.18. For the exercises we use numbering such as II§85 Exercise 4, meaning 
the fourth exercise in 85 of Chapter II. The bibliography is divided into two parts, the first 
containing books and survey articles, and the second research papers. The books and survey 
articles are referred to by number, e.g., G. Birkhoff [3], and the research papers by year, e.g., 
R. McKenzie [1978]. 
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Preliminaries 


We have attempted to keep our notation and conventions in agreement with those of the 
closely related subject of model theory, especially as presented in Chang and Keisler’s Model 
Theory [8]. The reader needs only a modest exposure to classical algebra; for example he 
should know what groups and rings are. 

We will assume a familiarity with the most basic notions of set theory. Actually, we use 
classes as well as sets. A class of sets is frequently called a family of sets. The notations, 
A;, i € I, and (A;)ier refer to a family of sets indexed by a set I. A naive theory of sets 
and classes is sufficient for our purposes. We assume the reader is familiar with membership 
(€), set-builder notation ({—:—}), subset (C), union (U), intersection (A), difference (—), 
ordered n-tuples ((x1,...,%pn)), (direct) products of sets (Ax B,[[,-; Ai), and (direct) powers 
of sets (A‘). Also, it is most useful to know that 


(a) concerning relations: 
(i 
(ii 


(iii 


) an n-ary relation on a set A is a subset of A”; 

) if n = 2 it is called a binary relation on A; 

) the inverse r~ of a binary relation r on A is specified by (a,b) € r” iff (b,a) € r; 
) the relational product ros of two binary relations r, s on A is given by: (a,b) € ros 
iff for some c, (a,c) € 1, (c,b) € s; 


(iv 


(b) concerning functions: 


(i) a function f from a set A to a set B, written f : A — B, is a subset of Ax B 
such that for each a € A there is exactly one b € B with (a,b) € f; in this case 
we write f(a) =bor f: arn); 

(ii) the set of all functions from A to B is denoted by BA; 

(iii) the function f € B4 is injective (or one-to-one) if f(a.) = f(az) > a, = ag; 


(iv) the function f € BA is surjective (or onto) if for every b € B there is ana € A 
with f(a) =); 
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(v) the function f € BA is bijective if it is both injective and surjective; 
(vi) for f€ B4 and X CA, f(X)={be B: f(a) =b for some a € X}; 
(vii) for fe BA andY CB, f7(Y)={aeA: f(a) €Y}; 
) 


(viii) for f: A> Bandg: B—C, let go f : A —C be the function defined by 
(go f)(a) = g(f(a)). [This does not agree with the relational product defined 
above—but the ambiguity causes no problem in practice.]; 


(c) given a family F of sets, the union of F, U F, is defined by a € U F iff a € A for some 
A € F (define the intersection of F, (\F, dually); 


(d) a chain of sets C is a family of sets such that for each A,B € C either A C B or 
BCA; 


(e) Zorn’s lemma says that if Fis a nonempty family of sets such that for each chain C’ of 
members of F’ there is a member of F containing (JC (ie., C has an upper bound in 
F) then F’ has a maximal member M (i.e., M € F and MCA € F implies M = A); 


(f) concerning ordinals: 
(i) the ordinals are generated from the empty set @ using the operations of successor 
(ct = aU {x}) and union; 


(ii) O= 2, 1=0*, 2=17, etc.; the finite ordinals are 0,1,...; andn = {0,1,...,n— 
1}; the natural numbers are 1,2,3..., the nonzero finite ordinals; 


(iii) the first infinite ordinal is w = {0,1,2,...}; 


(iv) the ordinals are well-ordered by the relation €, also called <; 
(g) concerning cardinality: 


(i) two sets A and B have the same cardinality if there is a bijection from A to B; 


(ii) the cardinals are those ordinals « such that no earlier ordinal has the same car- 
dinality as k. The finite cardinals are 0,1,2,...; and w is the smallest infinite 
cardinal; 


(iii) the cardinality of a set A, written |A], is that (unique) cardinal « such that A and 
k have the same cardinality; 

(iv) |A|-|B] = |A x B| [= max(|A], |B]) if either is infinite and A,B 4 2]. ANB = 
@ => |A|+ |B] =|AU B| [= max(|A|,|B]) if either is infinite]; 


(h) one usually recognizes that a class is not a set by noting that it is too big to be put in 
one-to-one-correspondence with a cardinal (for example, the class of all groups). 


In Chapter IV the reader needs to know the basic definitions from point set topology, 
namely what a topological space, a closed (open) set, a subbasis (basis) for a topological space, 
a closed (open) neighborhood of a point, a Hausdorff space, a continuous function, etc., are. 

The symbol “=” is used to express the fact that both sides name the same object, whereas 
“x” is used to build equations which may or may not be true of particular elements. (A 
careful study of ¥ is given in Chapter II.) 


Chapter I 


Lattices 


In the study of the properties common to all algebraic structures (such as groups, rings, etc.) 
and even some of the properties that distinguish one class of algebras from another, lattices 
enter in an essential and natural way. In particular, congruence lattices play an important 
role. Furthermore, lattices, like groups or rings, are an important class of algebras in their 
own right, and in fact one of the most beautiful theorems in universal algebra, Baker’s finite 
basis theorem, was inspired by McKenzie’s finite basis theorem for lattices. In view of this 
dual role of lattices in relation to universal algebra, it is appropriate that we start with a 
brief study of them. In this chapter the reader is acquainted with those concepts and results 
from lattice theory which are important in later chapters. Our notation in this chapter is 
less formal than that used in subsequent chapters. We would like the reader to have a casual 
introduction to the subject of lattice theory. 

The origin of the lattice concept can be traced back to Boole’s analysis of thought and 
Dedekind’s study of divisibility. Schroeder and Pierce were also pioneers at the end of the 
last century. The subject started to gain momentum in the 1930’s and was greatly promoted 
by Birkhoff’s book Lattice Theory in the 1940’s. 
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There are two standard ways of defining lattices—one puts them on the same (algebraic) 
footing as groups or rings, and the other, based on the notion of order, offers geometric 
insight. 


Definition 1.1. A nonempty set L together with two binary operations V and A (read 
“join” and “meet” respectively) on L is called a lattice if it satisfies the following identities: 
Ll: (a)eVyeyVa 

(b)zaAysyAaz (commutative laws) 
L2: (a) eV (yVz) e (“@Vy)VzZz 
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(b)zA(yAz) &(aAy)Az (associative laws) 
L3: (a)aVaxe 

(be Se (idempotent laws) 
LAY (a) eee ley) 

(b) ax aA(rvy) (absorption laws). 


EXAMPLE. Let L be the set of propositions, let V denote the connective “or” and (A denote 
the connective “and”. Then L1 to L4 are well-known properties from propositional logic. 


EXAMPLE. Let L be the set of natural numbers, let V denote the least common multiple 
and A denote the greatest common divisor. Then properties L1 to L4 are easily verifiable. 


Before introducing the second definition of a lattice we need the notion of a partial order 
on a set. 


Definition 1.2. A binary relation < defined on a set A is a partial order on the set A if the 
following conditions hold identically in A: 


(i) a<a (reflexivity) 
(ii) a< band b<aimplya=b (antisymmetry) 
(iii) a<bandb<cimplya<c (transitivity). 


If, in addition, for every a,b in A 
(iv) a<borb<a 


then we say < is a total order on A. A nonempty set with a partial order on it is called a 
partially ordered set, or more briefly a poset, and if the relation is a total order then we speak 
of a totally ordered set, or a linearly ordered set, or simply a chain. In a poset A we use the 
expression a < b to mean a < 6 but aF b. 


EXAMPLES. (1) Let Su(A) denote the power set of A, i.e., the set of all subsets of A. Then 
C is a partial order on Su(A). 

(2) Let A be the set of natural numbers and let < be the relation “divides.” Then < is 
a partial order on A. 

(3) Let A be the set of real numbers and let < be the usual ordering. Then < is a total 
order on A. 

Most of the concepts developed for the real numbers which involve only the notion of 
order can be easily generalized to partially ordered sets. 


Definition 1.3. Let A be a subset of a poset P. An element p in P is an upper bound for 
A if a < p for every a in A. An element p in P is the least upper bound of A (l.u.b. of A), 
or supremum of A (sup A) if p is an upper bound of A, and a < 6 for every a in A implies 
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p <0 (i.e., p is the smallest among the upper bounds of A). Similarly we can define what it 
means for p to be a lower bound of A, and for p to be the greatest lower bound of A (g.1.b. 
of A), also called the infimum of A (inf A). For a,b in P we say b covers a, or a is covered 
by b, if a < b, and whenever a < c < b it follows that a = c or c = b. We use the notation 
a < 6 to denote a is covered by b. The closed interval a, b] is defined to be the set of c in P 
such that a <c < b, and the open interval (a,b) is the set of cin P such thata<c<b. 

Posets have the delightful characteristic that we can draw pictures of them. Let us 
describe in detail the method of associating a diagram, the so-called Hasse diagram, with 
a finite poset P. Let us represent each element of P by a small circle “o”. If a ~ b then 
we draw the circle for b above the circle for a, joining the two circles with a line segment. 
From this diagram we can recapture the relation < by noting that a < 6 holds iff for some 
finite sequence of elements c,,...,C, from P we have a = cy ~ €2-++Cn_1 ~ Cn = 6. We have 
drawn some examples in Figure 1. It is not so clear how one would draw an infinite poset. 
For example, the real line with the usual ordering has no covering relations, but it is quite 
common to visualize it as a vertical line. Unfortunately, the rational line would have the 
same picture. However, for those infinite posets for which the ordering is determined by the 
covering relation it is often possible to draw diagrams which do completely convey the order 
relation to the viewer; for example, consider the diagram in Figure 2 for the integers under 
the usual ordering. 


(e) (f) (g) (h) 


Figure 1 Examples of Hasse diagrams 
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Figure 2. Drawing the poset of the integers 


Now let us look at the second approach to lattices. 
Definition 1.4. A poset L is a lattice iff for every a,b in L both sup{a, b} and inf{a, b} exist 
(in L). 


The reader should verify that for each of the diagrams in Figure 1 the corresponding 
poset is a lattice, with the exception of (e). The poset corresponding to diagram (e) does 
have the interesting property that every pair of elements has an upper bound and a lower 
bound. 

We will now show that the two definitions of a lattice are equivalent in the following 
sense: if L is a lattice by one of the two definitions then we can construct in a simple and 
uniform fashion on the same set LD a lattice by the other definition, and the two constructions 
(converting from one definition to the other) are inverses. First we describe the constructions: 


(A) If L is a lattice by the first definition, then define < on L by a < biffa =a b; 
(B) If L is a lattice by the second definition, then define the operations V and A by aVb = 
sup{a, b}, and aA b = inf{a, b}. 


Suppose that L is a lattice by the first definition and < is defined as in (A). From aAa = a 
follows a <a. Ifa <bandb<athena=a/Ab and b= bAa; hence a = b. Also if a < b 
and b<cthena=aAbandb=bAc,soa=aAb=ad(bAc) = (aAb)Ac=adc; hence 
a <c. This shows < is a partial order on L. From a=a/A(aVb) andb=bA (aV b) follow 
a<aVbandb<avVb,soaV bis an upper bound of both a and b. Now if a < u and b < u 
then aVu=(aAu)Vu= u, and likewise b Vu = u, so (aVu) V(bV u) =uV u = u; hence 
(aV b) Vu = 4, giving (aV b) Au = (a Vb) A [(a V b) Vu] = a V b (by the absorption law), 
and this says aV b <u. Thus a V b= sup{a, b}. Similarly, a A b = inf{a, d}. 

If, on the other hand, we are given a lattice L by the second definition, then the 
reader should not find it too difficult to verify that the operations V and / as defined 
in (B) satisfy the requirements L1 to L4, for example the absorption law L4(a) becomes 
a = sup{a, inf{a, b}}, which is clearly true as inf{a, b} < a. 

The fact that these two constructions (A) and (B) are inverses is now an easy matter to 
check. Throughout the text we will be using the word lattice to mean lattice by the first 
definition (with the two operations join and meet), but it will often be convenient to freely 
make use of the corresponding partial order. 
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EXERCISES §1 


li, 


10. 


Verify that Su(X) with the partial order C is a lattice. What are the operations V and 
A? 


Verify L1—-L4 for V, A as defined in (B) below Definition 1.4. 
Show that the idempotent laws L3 of lattices follow from L1, L2, and L4. 


Let C/0,1] be the set of continuous functions from [0,1] to the reals. Define < on 
C0, 1] by f < g iff f(a) < g(a) for all a € [0,1]. Show that < is a partial order which 
makes C0, 1] into a lattice. 


If L is a lattice with operations V and A, show that interchanging V and A still gives a 
lattice, called the dual of L. (For constrast, note that interchanging + and - in a ring 
usually does not give another ring.) Note that dualization turns the Hasse diagram 
upside down. 


If G is a group, show that the set of subgroups $(G) of G with the partial ordering 
C forms a lattice. Describe all groups G whose lattices of subgroups look like (b) of 
Figure 1. 


If G is a group, let N(G) be the set of normal subgroups of G. Define V and A on 
N(G) by Ni A No = Ni ONo, and N,V No = N,No = {nyne a Ni, nN E No}. Show 
that under these operations N(G) is a lattice. 


If R is a ring, let [(R) be the set of ideals of R. Define V and A on I(R) by A Ig = 
Ab, LV Ig = {t1 +42: i, € Ih, tg € In}. Show that under these operations [(R) is a 
lattice. 


If < is a partial order on a set A, show that there is a total order <* on A such that 
a < b implies a <* b. (Hint: Use Zorn’s lemma.) 


If L is a lattice we say that an element a € L is join irreducible if a = bV c implies a = b 
or a =c. If L is a finite lattice show that every element is of the form a, V --- V dy, 
where each a; is join irreducible. 
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§2. Isomorphic Lattices, and Sublattices 


The word isomorphism is used to signify that two structures are the same except for the 
nature of their elements (for example, if the elements of a group are painted blue, one still 
has essentially the same group). The following definition is a special case of I1§2.1. 


Definition 2.1. Two lattices L; and Lz are isomorphic if there is a bijection a from L to 
Lz such that for every a,b in L, the following two equations hold: a(a V b) = a(a) V a(b) 
and a(a A b) = a(a) A a(b). Such an a is called an isomorphism. 

It is useful to note that if a is an isomorphism from L, to Ly then a7! is an isomorphism 
from Ly to L,, and if @ is an isomorphism from Ly, to L3 then Goa is an isomorphism from 
L, to L3. One can reformulate the definition of isomorphism in terms of the corresponding 
order relations. 


Definition 2.2. If P, and P, are two posets and a is a map from P,; to P:, then we say a 
is order-preserving if a(a) < a(b) holds in Py whenever a < b holds in P,. 


Theorem 2.3. Two lattices L, and Lz are isomorphic iff there is a bijection a from Ly to 
Ly such that both a and a“! are order-preserving. 


PROOF. If a is an isomorphism from L; to Ly and a < 6 holds in L, then a = aA b, so 
a(a) = a(aAb) = a(a) Aa(b), hence a(a) < a(b), and thus a is order-preserving. As a7? is 
an isomorphism, it is also order-preserving. 

Conversely, let @ be a bijection from L; to Ly such that both a and a are order- 
preserving. For a,b in L; we have a < aV band b < aVb, so a(a) < a(aV b) and 
a(b) < a(aVb), hence a(a) Va(b) < a(aV b). Furthermore, if a(a) Va(b) <u then a(a) < u 
and a(b) < u, hence a < a7'(u) andb<a™'(u), soaVb<a7'(u), and thus a(aV bd) < u. 
This implies that a(a)Va(b) = a(aVb). Similarly, it can be argued that a(a)Aa(b) = a(aAb). 


1 


iy 

Y 
———o 

iy 


b aera 

Cc a Cc 

d Scho 
L, L> 


Figure 3. An order-preserving bijection 
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It is easy to give examples of bijections a between lattices which are order-preserving but 
are not isomorphisms; for example, consider the map a(a) = a,...,a(d) = d where L, and 
Ly are the two lattices in Figure 3. 

A sublattice of a lattice DL is a subset of L which is a lattice in its own right, using the 
same operations. 


Definition 2.4. If L is a lattice and L’ 4 @ is a subset of L such that for every pair of 
elements a,b in L’ both a Vb anda/ bare in L’, where V and A are the lattice operations 
of L, then we say that L’ with the same operations (restricted to L’) is a sublattice of L. 


If L’ is a sublattice of L then for a,b in L’ we will of course have a < b in L’ iff a <b in 
L. It is interesting to note that given a lattice L one can often find subsets which as posets 
(using the same order relation) are lattices, but which do not qualify as sublattices as the 
operations V and /A do not agree with those of the original lattice L. The example in Figure 
4 illustrates this, for note that P = {a,c,d,e} as a poset is indeed a lattice, but P is not a 
sublattice of the lattice {a, b,c, d,e}. 


ao 
b 
c d 
e 
Figure 4 


Definition 2.5. A lattice L, can be embedded into a lattice Lz if there is a sublattice of L» 
isomorphic to [,; in this case we also say Ly contains a copy of Ly as a sublattice. 


EXERCISES §2 


1. If (X,7) is a topological space, show that the closed subsets, as well as the open 
subsets, form a lattice using C as the partial order. Show that the lattice of open 
subsets is isomorphic to the dual (see §1, Exercise 5) of the lattice of closed subsets. 


2. If P and Q are posets, let Q? be the poset of order-preserving maps from P to Q, 
where for f,g € Q? we define f < g iff f(a) < g(a) for alla € P. If Q is a lattice show 
that Q? is also a lattice. 


3. If Gis a group, is N(G) a sublattice of S(G) (see §1, Exercises 6,7)? 
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4. If < is a partial order on P then a lower segment of P is a subset S of P such that if 
s€S, pe€P,andp<-s then p€ S. Show that the lower segments of P form a lattice 
with the operations U, . If P has a least element, show that the set L(P) of nonempty 
lower segments of P forms a lattice. 


5. If Lis a lattice, then an ideal I of L is a nonempty lower segment closed under V. Show 
that the set of ideals /(L) of L forms a lattice under C. 


6. Given a lattice LZ, an ideal I of L is called a principal ideal if it is of the form {b € 
L:b< a}, for some a € L. (Note that such subsets are indeed ideals.) Show that the 
principal ideals of L form a sublattice of [(L) isomorphic to L. 


83. Distributive and Modular Lattices 


The most thoroughly studied classes of lattices are distributive lattices and modular lattices. 


Definition 3.1. A distributive lattice is a lattice which satisfies either (and hence, as we 
shall see, both) of the distributive laws, 


Diner (y V2) 
DIY (yA) 


~ 
~ 
~ 
~ 


Theorem 3.2. A lattice L satisfies D1 iff it satisfies D2. 


PROOF. Suppose D1 holds. Then 


EV WN) Se (eV (GAZ) VGA Z) (by L4(a)) 
wav ((f£Az)V(yAz)) (by L2(a)) 
eaoV((zAx)V(zZAy)) (by L1(b)) 
waV(zA(zvy)) (by D1) 
~av ((4 Vy) Az) (by L1(b)) 
ee (xA(zVy))V((@Vy)Az) (by L4(b)) 
x ((aVy)Aa)V((eVvy)Az) (by L1(b)) 
(GEN Oo) NEN) (by D1) 


Thus D2 also holds. A similar proof shows that if D2 holds then so does D1. 


Actually every lattice satisfies both of the inequalities (x Ay) V (x@Az) <aA(yVz) and 
rV(yAz) < (xVy)A(«Vz). To see this, note for example that rA\y <a andaAy <yVz; 
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hence zr Ay <xA(yV 2), etc. Thus to verify the distributive laws in a lattice it suffices to 
check either of the following inequalities: 


Definition 3.3. A modular lattice is any lattice which satisfies the modular law 
M:a<yroarV(yAz)=yA(rV 2). 
The modular law is obviously equivalent (for lattices) to the identity 
(cAy)V(yAz) SYA ((ZAY)V 2) 
since a < b holds iff a=a/A b. Also it is not difficult to see that every lattice satisfies 
r<yrouV(yAz)<yA(tVvz), 
so to verify the modular law it suffices to check the implication 


r<yrmyA(£Vz)<2V(yAz). 


Theorem 3.4. Every distributive lattice is a modular lattice. 


PROOF. Just use D2, noting that a V b = b whenever a < b. 


The next two theorems give a fascinating characterization of modular and distributive 
lattices in terms of two five-element lattices called Ms; and Ns; depicted in Figure 5. In 
neither case is a V (bA c) = (aV b) A (a Vc), so neither Ms; nor Ns is a distributive lattice. 
For Ns we also see that a < b but aV (bAc) #bA (aVc), so Ns is not modular. With a 
small amount of effort one can verify that Ms does satisfy the modular law, however. 


Figure 5 
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Theorem 3.5 (Dedekind). L is a nonmodular lattice iff Ns can be embedded into L. 


PROOF. From the remarks above it is clear that if N; can be embedded into L, then L does 
not satisfy the modular law. For the converse, suppose that DL does not satisfy the modular 
law. Then for some a, b,c in L we have a < b but aV (bAc) < bA(aVc). Let ay =aV (bAc) 
and b; = bA (aV c). Then 


cV bh =cA [bA (aVo)] 
= [cA(cVa)|Ab (by L1(a), L1(b), L2(b)) 
=cAb (by L4(b)) 


and 


cVa,=cV lav (bAc)] 
[eV (cAb)] Va (by L1(a), L1(b), L2(a)) 
= Ova (by L4(a)). 


Now ascAb< a, < bi we have cAb<cAa, <cAb; =cAb, hence cA ay =cAb; =cAb. 
Likewise cV b} =cV a, =cVa. 

Now it is straightforward to verify that the diagram in Figure 6 gives the desired copy of 
Ns in L. 


cA\b 
Figure 6 


Theorem 3.6 (Birkhoff). L is a nondistributive lattice iff Ms or Ns can be embedded into 
L. 


PROOF. If either Ms or Ns; can be embedded into L, then it is clear from earlier remarks 
that L cannot be distributive. For the converse, let us suppose that L is a nondistributive 
lattice and that L does not contain a copy of Ns as a sublattice. Thus L is modular by 3.5. 
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Since the distributive laws do not hold in L, there must be elements a, b,c from L such that 
(aN b)V (adc) <aA(bVc). Let us define 


d=(aAb)V (aAc)V(bAc) 
e=(avb)A(aVec)A(bVc) 
a, =(aAe)Vd 
bi = (bAe) Vd 
qj = (eNe).V a: 


Then it is easily seen that d < ay, b,,c; < e. Now from 
a\e=aA(bVc) (by L4(b)) 
and (applying the modular law to switch the underlined terms) 
a\d=a/N((aAb)V (aAc)V (bAc)) 


(aA b)V (aAc))V (ad (bAc)) (by M) 


it follows that d < e. 


ay by Poy 


Figure 7 


We now wish to show that the diagram in Figure 7 is a copy of M; in L. To do this it 
suffices to show that a1 Ab; =a, Acq, = bi: Acq, =danda Vb =a, Vq=bVq =e. 
We will verify one case only and the others require similar arguments (in the following we 
do not explicitly state several steps involving commutativity and associativity; the terms to 
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be interchanged when the modular law is applied have been underlined): 


a, \ by = ((aNe) Vd) A ((bAe) V a) 


= ((aANe) A ((bAe) Vad)) Vd (by M) 
=((aAe)A((bVd)Ae)) Vd (by M) 
=((aAe) AeA (bVd)) Vd 

= ((ade)A(bVd)) Vd (by L3(b)) 

=(aA (bVc)A(bV (aAc)))Vd (by L4) 

= (aA (bV ((bVc)A(aAc))))Vd (by M) 

=(aA(bV(aAc))) Vd (aNc<bVec) 
=(aAc)V(bAa)Vd (by M) 


EXERCISES 83 


il, 


If we are given a set X, a sublattice of Su(X) under C is called a ring of sets (following 
the terminology used by lattice theorists). Show that every ring of sets is a distributive 
lattice. 


If L is a distributive lattice, show that the set of ideals J(L) of L (see §2 Exercise 5) 
forms a distributive lattice. 


Let (X, 7) be a topological space. A subset of X is regular open if it is the interior of 
its closure. Show that the family of regular open subsets of X with the partial order 
C is a distributive lattice. 


If L is a finite lattice let J(L) be the poset of join irreducible elements of L (see 
81 Exercise 10), where a < b in J(L) means a < b in L. Show that if L is a finite 
distributive lattice then L is isomorphic to L(J(L)) (see §2 Exercise 4), the lattice 
of nonempty lower segments of J(L). Hence a finite lattice is distributive iff it is 
isomorphic to some L(P), for P a finite poset with least element. (This will be used 
in V§5 to show the theory of distributive lattices is undecidable.) 


If G is a group, show that N(G), the lattice of normal subgroups of G (see §1 Exercise 
7), is a modular lattice. Is the same true of S(G)? Describe N(Z_ x Z2). 


If R is aring, show that /(R), the lattice of ideals of R (see §1 Exercise 8), is a modular 
lattice. 


If M is a left module over a ring R, show that the submodules of M under the partial 
order C form a modular lattice. 
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84. Complete Lattices, Equivalence Relations, 
and Algebraic Lattices 


In the 1930’s Birkhoff introduced the class of complete lattices to study the combinations of 
subalgebras. 


Definition 4.1. A poset P is complete if for every subset A of P both sup A and inf A 
exist (in P). The elements sup A and inf A will be denoted by \/ A and / A, respectively. 
All complete posets are lattices, and a lattice Z which is complete as a poset is a complete 
lattice. 


Theorem 4.2. Let P be a poset such that [A exists for every subset A, or such that \V A 
exists for every subset A. Then P is a complete lattice. 


PROOF. Suppose /\ A exists for every A C P. Then letting A" be the set of upper bounds 
of A in P, it is routine to verify that /\ A“ is indeed \V A. The other half of the theorem is 
proved similarly. 


In the above theorem the existence of /\ @ guarantees a largest element in P, and likewise 
the existence of \/ @ guarantees a smallest element in P. So an equivalent formulation of 
Theorem 4.2 would be to say that P is complete if it has a largest element and the inf of 
every nonempty subset exists, or if it has a smallest element and the sup of every nonempty 
subset exists. 


EXAMPLES. (1) The set of extended reals with the usual ordering is a complete lattice. 
(2) The open subsets of a topological space with the ordering C form a complete lattice. 
(3) Su(/) with the usual ordering C is a complete lattice. 


A complete lattice may, of course, have sublattices which are incomplete (for example, 
consider the reals as a sublattice of the extended reals). It is also possible for a sublattice 
of a complete lattice to be complete, but the sups and infs of the sublattice not to agree 
with those of the original lattice (for example look at the sublattice of the extended reals 
consisting of those numbers whose absolute value is less than one together with the numbers 
—2, +2). 


Definition 4.3. A sublattice L’ of a complete lattice L is called a complete sublattice of L 
if for every subset A of L’ the elements \/ A and /\ A, as defined in JZ, are actually in L’. 


In the 1930’s Birkhoff introduced the lattice of equivalence relations on a set, which is 
especially important in the study of quotient structures. 


Definition 4.4. Let A be a set. Recall that a binary relation r on A is a subset of A?. If 
(a,b) € r we also write arb. If r; and r2 are binary relations on A then the relational product 
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Tr, © Tg is the binary relation on A defined by (a,b) € r1 0 rg iff there is a c € A such that 
(a,c) € ry, and (c,b) € rg. Inductively one defines 71 org 0+ ++ OT, = (71 OT2O+++ OTp_1) OT. 
The inverse of a binary relation r is given by r° = {(a,b) € A? : (b,a) € r}. The diagonal 
relation A, on A is the set {(a,a) : a € A} and the all relation A? is denoted by V4. (We 
write simply A (read: delta) and V (read: nabla) when there is no confusion.) A binary 
relation r on A is an equivalence relation on A if, for any a,b,c from A, it satisfies: 


El: ara (reflexivity) 
E2: arb implies bra (symmetry) 
E3: arb and brc imply arc (transitivity). 


Eq(A) is the set of all equivalence relations on A. 


Theorem 4.5. The poset Eq(A), with C as the partial ordering, is a complete lattice. 


PRrooF. Note that Eq(A) is closed under arbitrary intersections. 


For 0; and 02 in Eq(A) it is clear that 0; A 02 = 0,162. Next we look at a (constructive) 
description of 0; V 45. 


Theorem 4.6. Jf 6, and 0 are two equivalence relations on A then 
6, V Oo = 0, U (0, 0 Oz) U (8, © A 0 O1) U (8; 0 6g 0 1 0 2) Us: 5 


or equivalently, (a,b) € 0, V 05 iff there is a sequence of elements c1,C2,..-,Cn from A such 
that 
(Ci, Citi) € 0, or (Cy, Ci41) € O2 


jor t= 1.30 = 1,ond. a= e,0: = Ce 
PROOF. It is not difficult to see that the right-hand side of the above equation is indeed an 


equivalence relation, and also that each of the relational products in parentheses is contained 
in A, V Ao. 


If {6;}ier is a subset of Eq(A) then it is also easy to see that /\,., 4; is just (),<; 4;. The 
following straightforward generalization of the previous theorem describes arbitrary sups in 
Eq(A). 


Theorem 4.7. [f 0; € Eq(A) fori € I, then 


VV 9: = JU8in 0 i, 0+ 085, + 0,4 ET, & < oo}. 


EL 


Definition 4.8. Let 6 be a member of Eq(A). For a € A, the equivalence class (or coset) of 
a modulo @ is the set a/6 = {b € A: (b,a) € 0}. The set {a/6: a € A} is denoted by A/6. 
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Theorem 4.9. For ? € Eq(A) and a,b € A we have 


(a) A= Use, a/0. 
(b) a/0 4 b/0 implies a/0N 6/0 = ©. 


PROOF. (Exercise). 


An alternative approach to equivalence relations is given by partitions, in view of 4.9. 


Definition 4.10. A partition 7 of a set A is a family of nonempty pairwise disjoint subsets 
of A such that A = (Jz. The sets in 7 are called the blocks of 7. The set of all partitions of 
A is denoted by II(A). 


For 7 in II(A), let us define an equivalence relation 6(7) by @(7) = {(a,b) € A® : {a,b} C 
B for some B in 7}. Note that the mapping 7 +> 6(7) is a bijection between II(A) and 
Eq(A). Define a relation < on II(A) by m1 < 7m iff each block of 7 is contained in some 
block of 7. 


Theorem 4.11. With the above ordering (A) is a complete lattice, and it is isomorphic 
to the lattice Eq(A) under the mapping 7 +> O(7). 


The verification of this result is left to the reader. 
Definition 4.12. The lattice H(A) is called the lattice of partitions of A. 
The last class of lattices which we introduce is that of algebraic lattices. 


Definition 4.13. Let L be a lattice. An element a in L is compact iff whenever \/ A exists 
anda < \/A for AC L, then a < \/ B for some finite B C A. L is compactly generated iff 
every element in L is a sup of compact elements. A lattice L is algebraic if it is complete 
and compactly generated. 


The reader will readily see the similarity between the definition of a compact element in 
a lattice and that of a compact subset of a topological space. Algebraic lattices originated 
with Komatu and Nachbin in the 1940’s and Biuchi in the early 1950’s; the original definition 
was somewhat different, however. 


EXAMPLES. (1) The lattice of subsets of a set is an algebraic lattice (where the compact 
elements are finite sets). 

(2) The lattice of subgroups of a group is an algebraic lattice (in which “compact” = 
“finitely generated” ). 

(3) Finite lattices are algebraic lattices. 

(4) The subset [0,1] of the real line is a complete lattice, but is not algebraic. 
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In the next chapter we will encounter two situations where algebraic lattices arise, namely 
as lattices of subuniverses of algebras and as lattices of congruences on algebras. 


EXERCISES §4 


iL 


2. 


§5. 


Show that the binary relations on a set A form a lattice under C . 


Show that the right-hand side of the equation in Theorem 4.6 is indeed an equivalence 
relation on A. 


If I is a closed and bounded interval of the real line with the usual ordering, and P a 
nonempty subset of J with the same ordering, show that P is a complete sublattice iff 
P is aclosed subset of J. 


If L is a complete chain show that L is algebraic iff for every a,,a2 € L with a, < ag 
there are 61, b2 € L with ay < by W be < ag. 


Draw the Hasse diagram of the lattice of partitions of a set with n elements for 1 < 
n <4. For |A| > 4 show that II(A) is not a modular lattice. 


If L is an algebraic lattice and D is a subset of L such that for d,,d2 € D there is a 
d3 € D with d, < d3,dz < d3 (i.e., D is upward directed) then, fora € L, aA \/ D= 


Vacp(a/ a). 


. If Lis a distributive algebraic lattice then, for any A C L, we have aA\V/ A = Vge4lan 


d). 


. Ifa and b are compact elements of a lattice L, show that aV 6 is also compact. Is aA b 


always compact? 


. If Lis a lattice with at least one compact element, let C'(L) be the poset of compact 


elements of L with the partial order on C'(L) agreeing with the partial order on L. An 
ideal of C(L) is a nonempty subset J of C(L) such that a,b € I implies aV b € J, and 
a€ I, b€ C(L) with b < a implies b € I. Show that the ideals of C(L) form a lattice 
under C if ZL has a least element and that the lattice of ideals of C(L) is isomorphic 
to L if L is an algebraic lattice. 


Closure Operators 


One way of producing, and recognizing, complete [algebraic] lattices is through [algebraic] 
closure operators. Tarski developed one of the most fascinating applications of closure op- 
erators during the 1930’s in his study of “consequences” in logic. 
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Definition 5.1. If we are given a set A, a mapping C': Su(A) — Su(A) is called a closure 
operator on A if, for X,Y C A, it satisfies: 


Cl: X CC(X) (extensive) 
COG X= CCX) (idempotent) 
C3: X CY implies C(X) C C(Y) (isotone). 


A subset X of A is called a closed subset if C(X) = X. The poset of closed subsets of A with 
set inclusion as the partial ordering is denoted by Lc. 


The definition of a closure operator is more general than that of a topological closure 
operator since we do not require that the union of two closed subsets be closed. 


Theorem 5.2. Let C be a closure operator on a set A. Then Lo is a complete lattice with 


A C(Ad) = (C(Ad) 


ie i€l 
and 


Vewuy=¢ (Yai). 


we. tel 


ProoF. Let (A;)je7 be an indexed family of closed subsets of A. From 


for each 2, we have 
so 


hence 
(na) =) As 


so (),<; Ai is in Lc. Then, if one notes that A itself is in Lg, it follows that Lc is a complete 
lattice. The verification of the formulas for the A\’s and \/’s of families of closed sets is 
straightforward. 


Interestingly enough, the converse of this theorem is also true, which shows that the 
lattices Lc arising from closure operators provide typical examples of complete lattices. 
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Theorem 5.3. Every complete lattice is isomorphic to the lattice of closed subsets of some 
set A with a closure operator C. 
PROOF. Let L be a complete lattice. For X C L define 

C(X) = {ae L:a<supX}. 


Then C is a closure operator on L and the mapping a+ {b € L: b < a} gives the desired 
isomorphism between L and Lo. 


The closure operators which give rise to algebraic lattices of closed subsets are called al- 
gebraic closure operators; actually the consequence operator of Tarski is an algebraic closure 
operator. 


Definition 5.4. A closure operator C on the set A is an algebraic closure operator if for 
every X CA 

C4: C(X) =U{C(Y) : Y CX and Y is finite}. 

(Note that C1, C2, C4 implies C3.) 

Theorem 5.5. If C is an algebraic closure operator on a set A then Lo is an algebraic 


lattice, and the compact elements of Lo are precisely the closed sets C(X), where X is a 
finite subset of A. 


PRrooF. First we will show that C(X) is compact if X is finite. Then by (C4), and in view 
of 5.2, Lo is indeed an algebraic lattice. So suppose X = {aj,...,a,} and 


C(X) C\/C(Ai) =C (U 4 


iel iel 
For each a; € X we have by (C4) a finite X; C U 
finitely many A,’s, say Ajy,..., Ajn,, such that 


icp Ai With a; € C(X;). Since there are 
Jn59 


Ay G Ag Un Agi 


then 
a; E C(Ajt U---U Ads) 
But then 
XC LU CAnu---UAjn,), 
1<j<k 
sO 


Xoc!] VU Axl, 
1<j<k 
1<i<n; 
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and hence 


c(xycce| U 4x) = Yo CAp), 
1<j<k 1<j<k 
1<i<n,; 1<i<n,; 


so C'(X) is compact. 
Now suppose C'(Y) is not equal to C(X) for any finite X. From 


Civ) Sc Lic) :X CY and X is finite} 


it is easy to see that C(Y) cannot be contained in any finite union of the C(X)’s; hence 
C(Y) is not compact. 


Definition 5.6. If C is a closure operator on A and Y is a closed subset of A, then we say 
a set X is a generating set for Y if C(X) = Y. The set Y is finitely generated if there is a 
finite generating set for Y. The set X is a minimal generating set for Y if X generates Y 
and no proper subset of X generates Y. 


Corollary 5.7. Let C be an algebraic closure operator on A. Then the finitely generated 
subsets of A are precisely the compact elements of Lo. 


Theorem 5.8. Every algebraic lattice is isomorphic to the lattice of closed subsets of some 
set A with an algebraic closure operator C. 


Proor. Let L be an algebraic lattice, and let A be the subset of compact elements. For 
X CA define 
C(X) ={ae A:a<\/ X}. 


C' is a closure operator, and from the definition of compact elements it follows that C’ is 
algebraic. The map a+> {b € A: b < a} gives the desired isomorphism as L is compactly 
generated. 
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EXERCISES 85 


1. If Gis a group and X CG, let C(X) be the subgroup of G generated by X. Show that 
C' is an algebraic closure operator on G. 


2. If G is a group and X CG, let C(X) be the normal subgroup generated by X. Show 
that C' is an algebraic closure operator on G. 
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3. If Ris aring and X C R, let C(X) be the ideal generated by X. Show that C' is an 


algebraic closure operator on R. 


. If L is a lattice and A C L, let u(A) = {b € L: a < b fora € A}, the set of upper 


bounds of A, and let [(A) = {0b € L:b <a fora € A}, the set of lower bounds of A. 
Show that CA) = I(u(a)) is a closure operator on A, and that the map a: at C({a}) 
gives an embedding of L into the complete lattice Le (called the Dedekind-MacNeille 
completion). What is the Dedekind-MacNeille completion of the rational numbers? 


. If we are given a set A, a family K of subsets of A is called a closed set system for A 


if there is a closure operator on A such that the closed subsets of A are precisely the 
members of K. If K C Su(A), show that K is a closed set system for A iff K is closed 
under arbitrary intersections. 


Given a set A and a family K of subsets of A, K is said to be closed under unions of 


chains if whenever CC K and C is a chain (under C) then (JC € K; and K is said to 
be closed under unions of upward directed families of sets if whenever D C K is such that 
A;, Ap € D implies A; U Ag C A3 for some A3 € D, then (JD € K. A result of set theory 
says that K is closed under unions of chains iff kK is closed under unions of upward directed 
families of sets. 


. (Schmidt). A closed set system K for a set A is called an algebraic closed set system 


for A if there is an algebraic closure operator on A such that the closed subsets of A 
are precisely the members of Kk. If k C Su(A), show that K is an algebraic closed set 
system iff K is closed under (i) arbitrary intersections and (ii) unions of chains. 


. If C is an algebraic closure operator on S and X is a finitely generated closed subset, 


then for any Y which generates X show there is a finite Yo C Y such that Yo generates 
X. 


. Let C be a closure operator on S. A closed subset X 4 S' is maximal if for any closed 


subset Y with X C Y CS, either X = Y or Y = S. Show that if C’ is algebraic and 
X CS with C(X) # S then X is contained in a maximal closed subset if S is finitely 
generated. (In logic one applies this to show every consistent theory is contained in a 
complete theory. ) 


Chapter II 


The Elements of Universal Algebra 


One of the aims of universal algebra is to extract, whenever possible, the common elements 
of several seemingly different types of algebraic structures. In achieving this one discovers 
general concepts, constructions, and results which not only generalize and unify the known 
special situations, thus leading to an economy of presentation, but, being at a higher level of 
abstraction, can also be applied to entirely new situations, yielding significant information 
and giving rise to new directions. 

In this chapter we describe some of these concepts and their interrelationships. Of pri- 
mary importance is the concept of an algebra; centered around this we discuss the notions 
of isomorphism, subalgebra, congruence, quotient algebra, homomorphism, direct product, 
subdirect product, term, identity, and free algebra. 


81. Definition and Examples of Algebras 


The definition of an algebra given below encompasses most of the well known algebraic struc- 
tures, as we shall point out, as well as numerous lesser known algebras which are of current 
research interest. Although the need for such a definition was noted by several mathemati- 
cians such as Whitehead in 1898, and later by Noether, the credit for realizing this goal goes 
to Birkhoff in 1933. Perhaps it should be noted here that recent research in logic, recur- 
sive function theory, theory of automata, and computer science has revealed that Birkhoff’s 
original notion could be fruitfully extended, for example to partial algebras and heteroge- 
neous algebras, topics which lie outside the scope of this text. (Birkhoff’s definition allowed 
infinitary operations; however, his main results were concerned with finitary operations. ) 


Definition 1.1. For A a nonempty set and n a nonnegative integer we define A® = {@}, 
and, for n > 0, A” is the set of n-tuples of elements from A. An n-ary operation (or function) 
on A is any function f from A” to A; n is the arity (or rank) of f. A finitary operation is an 
n-ary operation, for some n. The image of (a1,...,@,) under an n-ary operation f is denoted 
by f(a1,.--,@n). An operation f on A is called a nullary operation (or constant) if its arity 
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is zero; it is completely determined by the image f(@) in A of the only element @ in A®, 
and as such it is convenient to identify it with the element f(@). Thus a nullary operation is 
thought of as an element of A. An operation f on A is unary, binary, or ternary if its arity 
is 1,2, or 3, respectively. 


Definition 1.2. A language (or type) of algebras is a set F of function symbols such that 
a nonnegative integer n is assigned to each member f of F. This integer is called the arity 
(or rank) of f, and f is said to be an n-ary function symbol. The subset of n-ary function 
symbols in F is denoted by Fy. 


Definition 1.3. If F is a language of algebras then an algebra A of type F is an ordered 
pair (A, F’) where A is a nonempty set and F' is a family of finitary operations on A indexed 
by the language F such that corresponding to each n-ary function symbol f in F there is an 
n-ary operation f* on A. The set A is called the universe (or underlying set) of A = (A, F), 
and the f“’s are called the fundamental operations of A. (In practice we prefer to write just 
f for f4—this convention creates an ambiguity which seldom causes a problem. However, 
in this chapter we will be unusually careful.) If F is finite, say ¥ = {fi,..., fx}, we often 
write (A, f1,..., f,) for (A, F), usually adopting the convention: 


arity f, > arity fo >---> arity fr. 


An algebra A is unary if all of its operations are unary, and it is mono-unary if it has just 
one unary operation. A is a groupoid if it has just one binary operation; this operation is 
usually denoted by + or -, and we write a+b or a-b (or just ab) for the image of (a, b) under 
this operation, and call it the sum or product of a and 6, respectively. An algebra A is finite 
if |A| is finite, and trivial if |A| = 1. 

It is a curious fact that the algebras that have been most extensively studied in conven- 
tional (albeit modern!) algebra do not have fundamental operations of arity greater than 
two. (However see IV87 Ex. 8.) 

Not all of the following examples of algebras are well-known, but they are of considerable 
importance in current research. In particular we would like to point out the role of recent 
directions in logic aimed at providing algebraic models for certain logical systems. The reader 
will notice that all of the different kinds of algebras listed below are distinguished from each 
other by their fundamental operations and the fact that they satisfy certain identities. One 
of the early achievements of Birkhoff was to clarify the role of identities (see §11). 


EXAMPLES. (1) Groups. A group G is an algebra (G,-,~',1) with a binary, a unary, and 
nullary operation in which the following identities are true: 


Gl: a-(y-z)&(a-y)-z 
G2: 2-lel-rxee 
Gat ga ae ae ee 


A group G is Abelian (or commutative ) if the following identity is true: 
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G4: c-yXy-e. 


Groups were one of the earliest concepts studied in algebra (groups of substitutions 
appeared about two hundred years ago). The definition given above is not the one which 
appears in standard texts on groups, for they use only one binary operation and axioms 
involving existential quantifiers. The reason for the above choice, and for the descriptions 
given below, will become clear in §2. 

Groups are generalized to semigroups and monoids in one direction, and to quasigroups 
and loops in another direction. 


(2) SEMIGROUPS AND Monoips. A semigroup is a groupoid (G,-) in which (G1) is true. 
It is commutative (or Abelian) if (G4) holds. A monoid is an algebra (M,-,1) with a binary 
and a nullary operation satisfying (G1) and (G2). 


(3) QUASIGROUPS AND Loops. A quasigroup is an algebra (Q,/,-,\) with three binary 
operations satisfying the following identities: 


Ql: a\(a-y) Sy; (w@-y)/yra 

Q2: x-(x\y) sy; (a/y)-y 2. 

A loop is a quasigroup with identity, i.e., an algebra (Q, /,-, \,1) which satisfies (Q1), (Q2) 
and (G2). Quasigroups and loops will play a major role in Chapter III. 


(4) Rinas. A ring is an algebra (R,+,-,—,0), where + and - are binary, — is unary and 0 
is nullary, satisfying the following conditions: 


R1: (R,+,—,0) is an Abelian group 

R2: (R,-) is a semigroup 

R3: x- (y+ 2) = (a@-y) + (a: z) 
(w+ y)- 2 (a-z)+(y-2). 


A ring with identity is an algebra (R,+,-,—,0,1) such that (R1)—-(R3) and (G2) hold. 


(5) MODULES OVER A (FIXED) RING. Let R be a given ring. A (left) R-module is an 
algebra (M,+,—,0, (f;)rer) where + is binary, — is unary, 0 is nullary, and each f, is unary, 
such that the following hold: 


M1: (M,+,-—,0) is an Abelian group 
M2: f(x +y) © f(z) + f-(y), forre R 
M3: fr+s(@) © fr(x) + fe(a), for r,s € R 
MA: f,(fs(x)) © fre(z) for r,s € R. 


Let R be a ring with identity. A unitary R-module is an algebra as above satisfying (M1)— 
(M4) and 


MB: fi(a) & a. 
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(6) ALGEBRAS OvER A RING. Let R be a ring with identity. An algebra over R is an 
algebra (A,+,-,—,0, (f;)rer) such that the following hold: 


Al: (A,+,—-,0, (f-)rer) is a unitary R-module 
A2: (A,+,-,—,0) is a ring 
A3: flay) © (f(x) -y =v: frly) for r € R. 


(7) SEMILATTICES. A semilattice is a semigroup (S,-) which satisfies the commutative law 
(G4) and the idempotent law 


Sli v-xuea. 


Two definitions of a lattice were given in the last chapter. We reformulate the first 
definition given there in order that it be a special case of algebras as defined in this chapter. 


(8) LATTICES. A lattice is an algebra (L,V,/A) with two binary operations which satisfies 
(L1)-(L4) of I§1. 


(9) BOUNDED LATTICES. An algebra (L,V,/,0,1) with two binary and two nullary opera- 
tions is a bounded lattice if it satisfies: 


BLI1: (L,V,A) is a lattice 
BL2: cA0#0; eV1e1. 


(10) BOOLEAN ALGEBRAS. A Boolean algebra is an algebra (B,V,/,',0, 1) with two binary, 
one unary, and two nullary operations which satisfies: 


Bl: (B,V,A) is a distributive lattice 
B2: eA O-e V1 wel 
Bo PAs wee Va 


Boolean algebras were of course discovered as a result of Boole’s investigations into the 
underlying laws of correct reasoning. Since then they have become vital to electrical engi- 
neering, computer science, axiomatic set theory, model theory, and other areas of science 
and mathematics. We will return to them in Chapter IV. 


(11) HEyTING ALGEBRAS. An algebra (H,V,/A,—,0,1) with three binary and two nullary 
operations is a Heyting algebra if it satisfies: 


H1: (H,V,A) is a distributive lattice 

2. Bie RO; ey Leer, 

H3: x> x41 

H4: (c > y)AyRy;, eN\(a@>y)eady 

H5: t > (yAz) 8 (t@ oy) A(t > 2); (2@Vy) 92 (4 > Zz) Aly 2). 


These were introduced by Birkhoff under a different name, Brouwerian algebras, and with 
a different notation (v : u for u — v). 
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(12) n- VALUED PosT ALGEBRAS. An algebra (A, V,/A,’‘,0,1) with two binary, one unary, 
and two nullary operations is an n-valued Post algebra if it satisfies every identity satisfied 
by the algebra P,, = ({0,1,...,2 — 1},V,A,’,0,1) where ({0,1,...,27 —1},V,A,0,1) is a 
bounded chain with 0 << n-—1<n-2<---<2<1, andl’ = 2,2’ =3,...,(n—2)'= 
n—1,(n —1)/ = 0, and 0’ = 1. See Figure 8, where the unary operation ’ is depicted by 
arrows. In IV87 we will give a structure theorem for all n-valued Post algebras, and in V84 
show that they can be defined by a finite set of equations. 


n-1 


0 
Figure 8 The Post algebra P, 


(13) CYLINDRIC ALGEBRAS OF DIMENSION n. If we are given n € w, then an algebra 
(A, V,A,‘,C0,-++3€n—1,9, 1, doo, dor,.--,@n—1yn—1) With two binary operations, n + 1 unary 
operations, and n? + 2 nullary operations is a cylindric algebra of dimension n if it satisfies 
the following, where 0 <i,j,k <n: 


C1: (A, V,A,’,0,1) is a Boolean algebra 
C2: ¢0 +0 
C3: 2 < ox 
C4: o(a A cy) © (cx) A (cy) 
CO! Cee 0, Gx 
C6: dy = 1 
Cr: dz cid Nd) tts g Ak 
C8: c(dij At) A G(di; A 2’) %0 if t F 7. 
Cylindric algebras were introduced by Tarski and Thompson to provide an algebraic 
version of the predicate logic. 


(14) ORTHOLATTICES. An algebra (L,V,/,',0,1) with two binary, one unary and two 
nullary operations is an ortholattice if it satisfies: 

Ql: (L,V,A,0,1) is a bounded lattice 

VO ge ee ae Ve 
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Qs: (© Ay)! ea VyS (avy) eal Ay! 
Q4: (2') & z. 


An orthomodular lattice is an ortholattice which satisfies 


Q5: e<yourv(a' Ay) &y. 
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EXERCISES §1 


ie 


5. 


An algebra (A, F’) is the reduct of an algebra (A, F*) to F if F C F*, and F is the 
restriction of F* to F. Given n > 1, find equations © for semigroups such that © will 
hold in a semigroup (S,-) iff (S,-) is a reduct of a group (S,-,~1, 1) of exponent n (i.e., 
every element of S is such that its order divides n). 


Two elements a, b of a bounded lattice (L, V, A, 0,1) are complements if aVb = 1, aAb = 
0. In this case each of a,b is the complement of the other. A complemented lattice is a 
bounded lattice in which every element has a complement. 


(a) Show that in a bounded distributive lattice an element can have at most one 
complement. 


(b) Show that the class of complemented distributive lattices is precisely the class of 
reducts of Boolean algebras (to {V, A, 0, 1}). 


If (B,V,A,’,0,1) is a Boolean algebra and a,b € B, define a > b to be a’ V b. Show 
that (B,V,A,—,0,1) is a Heyting algebra. 


Show that every Boolean algebra is an ortholattice, but not conversely. 
(a) If (H,V,A,—,0,1) is a Heyting algebra and a,b € H show that a — b is the 
largest element c of H (in the lattice sense) such that aA c< b. 


(b) Show that the class of bounded distributive lattices (L,V,/A,0,1) such that for 
each a,b € L there is a largest c € L with aAc < b is precisely the class of reducts 
of Heyting algebras (to {V, A, 0, 1}). 
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(c) Show how one can construct a Heyting algebra from the open subsets of a topo- 
logical space. 


(d) Show that every finite distributive lattice is a reduct of a Heyting algebra. 


6. Let (M,-,1) be a monoid and suppose A C M. For a € A define fg : M — M 
by fa(s) = a-s. Show that the unary algebra (M, (fa)aca) satisfies fa, --+ fa, (uv) © 
fo ++ fo, (©) Uff a1-+-+Gn = b,-+-by. (This observation of Mal’cev [24] allows one to 
translate undecidability results about word problems for monoids into undecidability 
results about equations of unary algebras. This idea has been refined and developed 


by McNulty [1976] and Murskii [1971]). 


§2. Isomorphic Algebras, and Subalgebras 


The concepts of isomorphism in group theory, ring theory, and lattice theory are special 
cases of the notion of isomorphism between algebras. 


Definition 2.1. Let A and B be two algebras of the same type F. Then a function a: A > B 
is an tsomorphism from A to B if a is one-to-one and onto, and for every n-ary f € F, for 
a1,...,@4n € A, we have 


OF? (Gis ces VT” (Oasis): (x) 


We say A is isomorphic to B, written A = B, if there is an isomorphism from A to B. If a 
is an isomorphism from A to B we may simply say “a: A — B is an isomorphism”. 


As is well-known, following Felix Klein’s Erlanger Programm, algebra is often considered 
as the study of those properties of algebras which are invariant under isomorphism, and such 
properties are called algebraic properties. Thus from an algebraic point of view, isomorphic 
algebras can be regarded as equal or the same, as they would have the same algebraic 
structure, and would differ only in the nature of the elements; the phrase “they are equal up 
to isomorphism” is often used. 

There are several important methods of constructing new algebras from given ones. Three 
of the most fundamental are the formation of subalgebras, homomorphic images, and direct 
products. These will occupy us for the next few sections. 


Definition 2.2. Let A and B be two algebras of the same type. Then B is a subalgebra of 
A if B C A and every fundamental operation of B is the restriction of the corresponding 
operation of A, i.e., for each function symbol f, f? is f* restricted to B; we write simply 
B < A. A subuniverse of A is a subset B of A which is closed under the fundamental 
operations of A, i.e., if f is a fundamental n-ary operation of A and aj,...,a@, € B we 
would require f(a1,...,@n) € B. 
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Thus if B is a subalgebra of A, then B is a subuniverse of A. Note that the empty set 
may be a subuniverse, but it is not the underlying set of any subalgebra. If A has nullary 
operations then every subuniverse contains them as well. 

It is the above definition of subalgebra which motivated the choice of fundamental op- 
erations for the several examples given in $1. For example, we would like a subalgebra of 
a group to again be a group. If we were to consider a group as an algebra with only the 
usual binary operation then, unfortunately, subalgebra would only mean subsemigroup (for 
example the positive integers are a subsemigroup, but not a subgroup, of the group of all 
integers). Similar remarks apply to rings, modules, etc. By considering a suitable modifica- 
tion (enlargement) of the set of fundamental operations the concept of subalgebra as defined 
above coincides with the usual notion for the several examples in §1. 

A slight generalization of the notion of isomorphism leads to the following definition. 


Definition 2.3. Let A and B be of the same type. A function a: A — B is an embedding of 
A into B if a is one-to-one and satisfies (*) of 2.1 (such an a is also called a monomorphism). 
For brevity we simply say “a: A — B is an embedding”. We say A can be embedded in B 
if there is an embedding of A into B. 


Theorem 2.4. Ifa: A — B is an embedding, then a(A) is a subuniverse of B. 
PrRoor. Let a: A — B be an embedding. Then for an n-ary function symbol f and 


Q1,...,4n € A, 
fPB (aay, Unig Ej) af*(a1, ..+;n) € a(A), 


hence a(A) is a subuniverse of B. 


Definition 2.5. If a : A — B is an embedding, a(A) denotes the subalgebra of B with 
universe a(A). 


A problem of general interest to algebraists may be formulated as follows. Let K be 
a class of algebras and let Ky be a proper subclass of K. (In practice, AK may have been 
obtained from the process of abstraction of certain properties of Ky, or A, may be obtained 
from Kk by certain additional, more desirable, properties.) Two basic questions arise in the 
quest for structure theorems. 


(1) Is every member of K isomorphic to some member of Ky? 
(2) Is every member of kK embeddable in some member of Ky? 


For example, every Boolean algebra is isomorphic to a field of sets (see IV$1), every group 
is isomorphic to a group of permutations, a finite Abelian group is isomorphic to a direct 
product of cyclic groups, and a finite distributive lattice can be embedded in a power of the 
two-element distributive lattice. Structure theorems are certainly a major theme in Chapter 
LY, 
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83. Algebraic Lattices and Subuniverses 


We shall now describe one of the natural ways that algebraic lattices arise in universal 
algebra. 


Definition 3.1. Given an algebra A define, for every X C A, 


Sg(X) = ()\{B : X C B and B is a subuniverse of A}. 


We read Sg(X) as “the subuniverse generated by X”. 
Theorem 3.2. Jf we are given an algebra A, then Sg is an algebraic closure operator on A. 


PROOF. Observe that an arbitrary intersection of subuniverses of A is again a subuniverse, 
hence Sg is a closure operator on A whose closed sets are precisely the subuniverses of A. 
Now, for any X C A define 


E(X) =X U{f(a,...,@n) : f is a fundamental n-ary operation on A and ay,...,an € X}. 
Then define E”(X) for n > 0 by 


E(x) =x 
Bett Xx) = BCX). 


As all the fundamental operations on A are finitary and 
X CE(X)C E*(X)C.--- 
one can show that (Exercise 1) 
Se(X) = XUE(X)UEBE*(X)uU---, 


and from this it follows that if a € Sg(X) then a € E"(X) for some n < w; hence for some 
finite Y C X, ae E”(Y). Thus a € Sg(Y). But this says Sg is an algebraic closure operator. 


Corollary 3.3. [f A is an algebra then Ugg, the lattice of subuniverses of A, is an algebraic 
lattice. 


The corollary says that the subuniverses of A, with C as the partial order, form an 
algebraic lattice. 


Definition 3.4. Given an algebra A, Sub(A) denotes the set of subuniverses of A, and 
Sub(A) is the corresponding algebraic lattice, the lattice of subuniverses of A. For X C A 
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we say X generates A (or A is generated by X, or X is a set of generators of A) if Sg(X) = A. 
The algebra A is finitely generated if it has a finite set of generators. 


One cannot hope to find any further essentially new lattice properties which hold for the 
class of lattices of subuniverses since every algebraic lattice is isomorphic to the lattice of 
subuniverses of some algebra. 


Theorem 3.5 (Birkhoff and Frink). Jf L is an algebraic lattice, then L = Sub(A), for some 
algebra A. 


Proor. Let C be an algebraic closure operator on a set A such that L ~ Lo (such exists 
by 185.8). For each finite subset B of A and each b € C(B) define an n-ary function fg, on 
A, where n = |B|, by 


by SB tara cia} 


a, otherwise, 


feo(ai, wey An) = 


and call the resulting algebra A. Then clearly 


Fa pl Gigiens Ga) 6 OC {Gi i54.Gn}); 


hence for X C A, 
Se(X) C C(X). 


On the other hand 
CX) = L{ce) : BCX and B is finite} 


and, for B finite, 


COB) =4 fas Gisee0h 0) SB = aid hs. CEB )} 
Cc Sg(B) 
Cc Se(X) 


imply 


hence 


Thus Lo = Sub(A), so Sub(A) = L. 


The following set-theoretic result is used to justify the possibility of certain constructions 
in universal algebra—in particular it shows that for a given type there cannot be “too many” 
algebras (up to isomorphism) generated by sets no larger than a given cardinality. Recall 
that w is the smallest infinite cardinal. 
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Corollary 3.6. If A is an algebra and X C A then |Sg(X)| < |X|+ |F]+w. 


ProoFr. Using induction on n one has |E"(X)| < |X| + |F| + w, so the result follows from 
the proof of 3.2. 


REFERENCE 
1. G. Birkhoff and O. Frink [1948] 


EXERCISE §3 
1. Show Sg(X) = XU E(X)U E?(X)U--- 


84. The Irredundant Basis Theorem 


Recall that finitely generated vector spaces have the property that all minimal generating 
sets have the same cardinality. It is a rather rare phenomenon, though, to have a “dimen- 
sion.” For example, consider the Abelian group Zg—it has both {1} and {2,3} as minimal 
generating sets. 


Definition 4.1. Let C' be a closure operator on A. For n < w, let C,, be the function defined 
on Su(A) by 
Cn(X) =|(J{C(Y) : ¥ © X,|¥| <n}. 


We say that C' is n-ary if 


where 


Lemma 4.2. Let A be an algebra all of whose fundamental operations have arity at most 
n. Then Sg is an n-ary closure operator on A. 


ProoF. Note that (using the F of the proof of 3.2) 
E(X) © (Sg)n(X) S Se(X); 
hence 
Se(X) =X UE(X)UE*(X)u--- 
€ (Sg)n(X) U (Se)a(X) U > 
C Sg(X), 


sO 
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Sg(X) = (Sg)n(X) U (Sg)n(X) U--+. 


Definition 4.3. Suppose C is a closure operator on S. A minimal generating set of S' is called 
an irredundant basis. Let IrB(C) = {n < w: S has an irredundant basis of n elements}. 


The next result shows that the length of the finite gaps in IrB(C) is bounded by n — 2 if 
Cis an n-ary closure operator. 


Theorem 4.4 (Tarski). [fC is an n-ary closure operator on S' with n > 2, and ifi <j with 
i,j € IrB(C) such that 


{i+1,...,j -1}NEB(C) =@, (x) 


then j-i <n-—1. In particular, ifn = 2 then IrB(C) is a convex subset of w, i.e., a sequence 
of consecutive numbers. 


Proor. Let B be an irredundant basis with |B| = 7. Let Kk be the set of irredundant bases 
A with |A| <7. 

The idea of the proof is simple. We will think of B as the center of S, and measure the 
distance from B using the “rings” C*t!(B) — C*(B). We want to choose a basis Ag in K 
such that Ap is as close as possible to B, and such that the last ring which contains elements 
of Ap contains as few elements of Ag as possible. We choose one of the latter elements ag and 
replace it by n or fewer closer elements }),...,6;, to obtain a new generating set A,, with 
|Ai| <i+n. Then A; contains an irredundant basis Ay. By the ‘minimal distance’ condition 
on Ag we see that Ay ¢ K, hence |Ag| > i, so |A2| > 7 by (*). Thus 7 <i+n. 

Now for the details of this proof, choose Ag € K such that 


Ay £ C*(B) imples A ¢ C*(B) 
for A € K (see Figure 9). Let t be such that 

Ay © Cr*'(B), Ao EC, (B). 
We can assume that 

|Ao 1M (Cr**(B) — C7(B))| < |AN (Cr*"(B) — C2 (B))| 

for all A € K with A C Ctt!(B). Choose 

ay € [C*t!(B) — C*(B)]M Ap. 
Then there must exist b1,...,0m € C!(B), for some m <n, with 


ado € Cr({b1, pe hes bm}), 
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6) 
Ag € C,,(A1), 
where 
Ay => (Ao = {ao}) U {b1, Peer nk 
hence 


C(Ao) © C(Ai), 


which says A, is a set of generators of S. Consequently, there is an irredundant basis Ag C Aj. 
Now |Ag| < |Ao| +7. If |Ao| + 2 < j, we see that the existence of A» contradicts the choice 
of Ag as then we would have 


Ay € K, Ae OE) 


and 
|A2n (Cr**(B) — C,(B))| < [Ao 9 (Cp*7(B) — CL(B))]- 


Thus |Ao| +7 > 7. As |Ao| < i, we have j —i <n. 


Figure 9 
EXAMPLE. If A is an algebra all of whose fundamental operations have arity not exceeding 
2 then IrB(Sg) is a convex set. This applies to all the examples given in §1. 
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EXERCISES §4 


1. Find IrB(Sg), where Sg is the subuniverse closure operator on the group of integers Z. 


2. If C is aclosure operator on a set S and X is a closed subset of S, show that 4.4 applies 
to the irredundant bases of X. 


3. If A is a unary algebra show that |IrB(Sg)| < 1. 


4. Give an example of an algebra A such that IrB(Sg) is not convex. 


85. Congruences and Quotient Algebras 


The concepts of congruence, quotient algebra, and homomorphism are all closely related. 
These will be the subjects of this and the next section. 

Normal subgroups, which were introduced by Galois at the beginning of the last century, 
play a fundamental role in defining quotient groups and in the so-called homomorphism and 
isomorphism theorems which are so basic to the general development of group theory. Ideals, 
introduced in the second half of the last century by Dedekind, play an analogous role in 
defining quotient rings, and in the corresponding homomorphism and isomorphism theorems 
in ring theory. Given such a parallel situation, it was inevitable that mathematicians should 
seek a general common formulation. In these two sections the reader will see that congruences 
do indeed form the unifying concept, and furthermore they provide another meeting place 
for lattice theory and universal algebra. 


Definition 5.1. Let A be an algebra of type F and let 6 € Eq(A). Then @ is a congruence 
on A if @ satisfies the following compatibility property: 


CP: For each n-ary function symbol f € F and elements a;,b; € A, if a;0b; holds for 
1<i<nthen 
FP Gisec Gn OF Wig a) 
holds. 


The compatibility property is an obvious condition for introducing an algebraic structure 
on the set of equivalence classes A/@, an algebraic structure which is inherited from the 
algebra A. For if a,,...,@, are elements of A and f is an n-ary symbol in F, then the easiest 
choice of an equivalence class to be the value of f applied to (a,/0,...,@,/0) would be simply 
fA(a4,-.-,@n)/0. This will indeed define a function on A/@ iff (CP) holds. We illustrate (CP) 
for a binary operation in Figure 10 by subdividing A into the equivalence classes of 0; then 
selecting a1, b; in the same equivalence class and dg, bo in the same equivalence class we want 
fA(a1, 61), f4(a2, bz) to be in the same equivalence class. 
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f*(a,-b) ¢, 


if *(ay>by) . 


Figure 10 


Definition 5.2. The set of all congruences on an algebra A is denoted by Con A. Let 0 
be a congruence on an algebra A. Then the quotient algebra of A by 0, written A/6, is the 
algebra whose universe is A/@ and whose fundamental operations satisfy 


fA (a,/0, aed , an/) = fr(a, et -;Qn)/0 
where aj,...,@, € A and f is an n-ary function symbol in F. 
Note that quotient algebras of A are of the same type as A. 


EXAMPLES. (1) Let G be a group. Then one can establish the following connection between 
congruences on G and normal subgroups of G: 

(a) If 0 € Con G then 1/0 is the universe of a normal subgroup of G, and for a,b € G 
we have (a,b) € 0 iffa-b-! € 1/6; 

(b) If N is a normal subgroup of G, then the binary relation defined on G by 


(a,b)€O iff a-b'teNn 


is a congruence on G with 1/0 = N. 

Thus the mapping 0 + 1/0 is an order-preserving bijection between congruences on G 
and normal subgroups of G. 

(2) Let R be aring. The following establishes a similar connection between the congru- 
ences on R and ideals of R: 

(a) If @ € Con R then 0/6 is an ideal of R, and for a,b € R we have (a,b) € 6 iff 
a—b€e0/6; 

(b) If J is an ideal of R then the binary relation @ defined on R by 


(a,b) € @iffa-—beTl 


40 II The Elements of Universal Algebra 


is a congruence on R with 0/6 = I. 
Thus the mapping 0 + 0/0 is an order-preserving bijection between congruences on R 
and ideals of R. 


These two examples are a bit misleading in that they suggest any congruence on an 
algebra might be determined by a single equivalence class of the congruence. The next 
example shows this need not be the case. 


(3) Let L be a lattice which is a chain, and let 6 be an equivalence relation on L such 
that the equivalence classes of @ are convex subsets of L (i.e., if a@@b and a < c < b then afc). 
Then @ is a congruence on L. 


We will delay further discussion of quotient algebras until the next section and instead 
concentrate now on the lattice structure of Con A. 


Theorem 5.3. (Con A, C) is a complete sublattice of (Eq(A), C), the lattice of equivalence 
relations on A. 


PROOF. To verify that Con A is closed under arbitrary intersection is straightforward. For 
arbitrary joins in Con A suppose 6; € Con A for i € J. Then, if f is a fundamental n-ary 
operation of A and 
(a1, bi), ee) (Qa bis) € VV O; 
iel 
where \/ is the join of Eq(A), then from [84.7 it follows that one can find ip,..., 7, € J such 
that 
(ai, bi) € Bi, 0 Oi, 0 -+ 0 8;,, O<i<n. 


An easy argument then suffices to show that 


UP (Giyss5 Ga) if Wises a) SU 0 0, Os +e Gis 


hence Vex 70: iS a congruence relation on A. 


Definition 5.4. The congruence lattice of A, denoted by Con A, is the lattice whose 
universe is Con A, and meets and joins are calculated the same as when working with 
equivalence relations (see I§4). 


The following theorem suggests the abstract characterization of congruence lattices of 
algebras. 


Theorem 5.5. For A an algebra, there is an algebraic closure operator 0 on A x A such 
that the closed subsets of A x A are precisely the congruences on A. Hence Con A is an 
algebraic lattice. 


85. Congruences and Quotient Algebras 4l 


PRooF. Let us start by setting up an appropriate algebraic structure on A x A. First, for 
each n-ary function symbol f in the type of A let us define a corresponding n-ary function 
f on Ax A by 


PC Gp Deeg Oe be PST iia ieee 2 ese Oa) 


Then we add the nullary operations (a,a) for each a € A, a unary operation s defined by 


5((a, b)) = (b, a), 


and a binary operation t defined by 


(ao0)s Ib b= 
tte). (6.d)) = 
(a 8), (ed) fe b) otherwise. 
Now it is an interesting exercise to verify that B is a subuniverse of this new algebra iff B is 
a congruence on A. Let © be the Sg closure operator on A x A for the algebra we have just 
described. Thus, by 3.3, Con A is an algebraic lattice. 


The compact members of Con A are, by 185.7, the finitely generated members O((aj, 5), 
w+ +3 (An, bp)) of Con A. 


Definition 5.6. For A an algebra and ay,...,a, € A let O(a1,...,a,) denote the congruence 
generated by {(a;,a;) :1< i,j <n}, ie., the smallest congruence such that a1,...,@, are in 
the same equivalence class. The congruence O(a,, a2) is called a principal congruence. For 
arbitrary X C A, let O(X) be defined to mean the congruence generated by X xX. 


Finitely generated congruences will play a key role in II§12, in Chapter IV, and Chapter 
V. In certain cases we already know a good description of principal congruences. 


EXAMPLES. (1) If G is a group and a,b,c,d € G then (a,b) € O(c, d) iff ab~' is a product 
of conjugates of cd~' and conjugates of de~!. This follows from the fact that the smallest 
normal subgroup of G containing a given element e has as its universe the set of all products 
of conjugates of e and conjugates of e~!. 

(2) If R is a ring with unity and a,b,c,d € R then (a,b) € O(c, d) iff a — b is of the form 
Voicjen Tile — d)s; where r;,5; € R. This follows from the fact that the smallest ideal of R 


containing a given element e of R is precisely the set pa res; 7,5 € Ryn> 1} : 


Some useful facts about congruences which depend primarily on the fact that © is an 
algebraic closure operator are given in the following. 


Theorem 5.7. Let A be an algebra, and suppose ay, b1,...,Qn,b, € A and @ € Con A. 
Then 
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a, b1) = O(b1, a1) 
(a1, 61),---, (Gn, bn)) = O(a, bt) V +++ V O(an, bn) 


—_— 


(bi, a) — O(a, by) 


O(bi, a1) C O(a, br); 


hence, by symmetry, 
O(a, by) _ O(bi, a1). 


(b) For 1 <i<n, 
(Gi, bi) € O((a1, bi), re) (das bn) )3 


hence 


O(ai, b;) e O((a1, bi), esate (Osis is) )3 


sO 
O(a, by) Vere V Oias bs) G O((a4, b1), atnsttrly (aig Da): 


On the other hand, for 1 <i <n, 


(ai, b;) — O(ai, b;) ie O(a, by) Vere V O(dn, EAP 


so 

{(a1, b1),...; (dn, bn) } C O(ai, b1) V +++ V O(an, bn); 
hence 

O((a1, b1),---; (An, bn)) © O(ai, b1) V-+- V O(an, bn), 
so 

O((ai, b1),.--, (Qn; On)) = Oar, b1) V--- V O(an, bn). 

(c) For 1 <i<n-1, 
(aj, @i41) € O(a4,..., An), 
so 
O(a;, @i41) C O(a1,..-, Gn); 

hence 


O(a1, a2) ass Ol Gn teGa) S O(a, ts sf Og) 


Conversely, for 1 <<i<j <n, 


(ai, a3) € O(aj, ai41) 0+ ++ 0 O(aj-1, a5) 
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so, by 184.7 
(ai, a;) — O(aj, Qi41) Vere V O(a;-1, Gig); 


hence 
(a;,a;) € O(a1, a2) V +++ V O(an-1, An). 


In view of (a) this leads to 


O(a1,..-,@n) © O(a1, a2) V-++ V O(an-1, Gn), 


O(a1,.--,4n) = O(a1, a2) V--- V O(dn_1, Gn). 
(d) For (a,b) € 6 clearly 
(a,b) € O(a, b) CO 
6 | J{O(a,b) : (a,b) € 0} C \/{O(a, b) : (a,b) € 0} CO: 
hence 


6 =|_J{9(a,b) : (a,b) € 0} = \/{O(a, b) : (a,b) € Of. 
(e) (Similar to (d).) 


One cannot hope for a further sharpening of the abstract characterization of congruence 
lattices of algebras in 5.5 because in 1963 Gratzer and Schmidt proved that for every algebraic 
lattice L there is an algebra A such that L = Con A. Of course, for particular classes of 
algebras one might find that some additional properties hold for the corresponding classes of 
congruence lattices. For example, the congruence lattices of lattices satisfy the distributive 
law, and the congruence lattices of groups (or rings) satisfy the modular law. One of the 
major themes of universal algebra has been to study the consequences of special assumptions 
about the congruence lattices (or congruences) of algebras (see $12 as well as Chapters IV 
and V). For this purpose we introduce the following terminology. 


Definition 5.8. An algebra A is congruence-distributive (congruence-modular) if Con A is 
a distributive (modular) lattice. If 6,02 € Con A and 


6, 069 = 0200, 


then we say @; and 6) are permutable, or 6; and 0) permute. A is congruence-permutable if 
every pair of congruences on A permutes. A class K of algebras is congruence-distributive, 
congruence-modular, respectively congruence-permutable iff every algebra in K has the de- 
sired property. 


We have already looked at distributivity and modularity, so we will finish this section 
with two results on permutable congruences. 
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Theorem 5.9. Let A be an algebra and suppose 6;,02 € Con A. Then the following are 
equivalent: 

(a) 9, 00g = 0,004 

(b) 6, V Oo = 01 0 2 

(c) 0, 0 A C A» 0 64. 


PROOF. (a) = (b): For any equivalence relation 6 we have 0 o @ = @, so from (a) it follows 
that the expression for 0; V 62 given in 184.6 reduces to 6; U (A; © 82), and hence to 6; © 03. 
(c) = (a): Given (c) we have to show that 


A 0 Oy CO, 0 Oy. 


This, however, follows easily from applying the relational inverse operation to (c), namely 
we have 


(01 0 2) € (82 0 1), 
and hence (as the reader can easily verify) 
Az 0 Oy C Oyo As. 


Since the inverse of an equivalence relation is just that equivalence relation, we have estab- 
lished (a). 
(b) = (c): Since 
8206, C A V Oo, 
from (b) we could deduce 
9206, CO, 0 A, 


and then from the previous paragraph it would follow that 


9200; = 0; 0 Ag; 


hence (c) holds. 


Theorem 5.10 (Birkhoff). [f A is congruence-permutable, then A is congruence-modular. 
PROOF. Let 61,602,083; € Con A with 6; C 09. We want to show that 
62.7 (A: V 63) C 61 V (821M 63), 
so suppose (a,b) is in 02M (6; V 03). By 5.9 there is an element c such that 
a6,c 3b 


holds as 
6, V 3 = 01 0 83. 
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By symmetry 


(c,a) € 04; 
hence 
(c,a) € Oo, 
and then by transitivity 
(c, b) € A. 
Thus 
Ce; b) € Ay N 03, 
so from 
aO,c(O2 M" 03)b 
follows 
(a, b) € 6,0 (02 NM 03); 
hence 


(a, b) C 0, V (O02 al 03). 


We would like to note that in 1953 Jonsson improved on Birkhoff’s result above by 
showing that one could derive the so-called Arguesian identity for lattices from congruence- 
permutability. In §12 we will concern ourselves again with congruence-distributivity and 
permutability. 


REFERENCES 


1. G. Birkhoff [3] 

2. G. Gratzer and E.T. Schmidt [1963] 
3. B. Jénsson [1953] 

4. P. Pudlak [1976] 


EXERCISES 85 


1. Verify the connection between normal subgroups and congruences on a group stated 
in Example 1 (after 5.2). 


2. Verify the connection between ideals and congruences on rings stated in Example 2 
(after 5.2). 


3. Show that the normal subgroups of a group form an algebraic lattice which is modular. 


4. Show that every group and ring is congruence-permutable, but not necessarily congruence- 
distributive. 
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5. Show that every lattice is congruence-distributive, but not necessarily congruence- 
permutable. 


6. In the proof of 5.5, verify that subuniverses of the new algebra are precisely the con- 
gruences on A. 


7. Show that © is a 2-ary closure operator. [Hint: replace each n-ary f of A by unary 
operations 


f(a, oe +, Qj-1, 0%, Aj41,---, Gigs) Q1,--+,QAj-1, Aj41,---,An E A 
and show this gives a unary algebra with the same congruences. | 


8. If A is a unary algebra and B is a subuniverse define @ by (a,b) € 6 iff a = b or 
{a,b} C B. Show that 6 is a congruence on A. 


9. Let S be a semilattice. Define a < b for a,b € S if a-b =a. Show that < is a partial 
order on S. Next, given a € S' define 


6, = {(b,c) € S x S: both or neither of a < b,a < c hold}. 


Show 6, is a congruence on S. 


An algebra A has the congruence extension property (CEP) if for every B < A and 
6 € Con B there is a ¢ € Con A such that 6 = 6M B?. A class K of algebras has the CEP 
if every algebra in the class has the CEP. 


10. Show that the class of Abelian groups has the CEP. Does the class of lattices have the 
CEP? 


11. If L is a distributive lattice and a,b,c,d € L show that (a,b) € O(c, d) iff eAdAa= 
cAdAbandcVdVa=cVdvb. 


An algebra A has 3-permutable congruences if for all 0,¢ € Con A we have 60 g¢06 C 
bo8od. 


12. (Jonsson) Show that if A has 3-permutable congruences then A is congruence-modular. 
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86. Homomorphisms and the Homomorphism and 
Isomorphism Theorems 


Homomorphisms are a natural generalization of the concept of isomorphism, and, as we shall 
see, go hand in hand with congruences. 


Definition 6.1. Suppose A and B are two algebras of the same type F. A mapping a : 
A — B is called a homomorphism from A to B if 


Qt a a Sf Bas SOG) 


for each n-ary f in F and each sequence ay,...,@, from A. If, in addition, the mapping a is 
onto then B is said to be a homomorphic image of A, and a is called an epimorphism. (In 
this terminology an isomorphism is a homomorphism which is one-to-one and onto.) In case 
A = Ba homomorphism is also called an endomorphism and an isomorphism is referred to 
as an automorphism. The phrase “a : A — B is a homomorphism” is often used to express 
the fact that a@ is a homomorphism from A to B. 


EXAMPLES. Lattice, group, ring, module, and monoid homomorphisms are all special cases 
of homomorphisms as defined above. 


Theorem 6.2. Let A be an algebra generated by a set X. Ifa: A— Band @: A — B are 
two homomorphisms which agree on X (i.e., a(a) = G(a) fora € X), thena=Q8. 


PRooF. Recall the definition of EF in §3. Note that if a and @ agree on X then a and ( 


agree on E(X), for if f is an n-ary function symbol and aj,...,a, € X then 
Gf Gijsiy Ga) HF (GG ie.teG,) 
=f Baines Cas) 
= BF (a, hut Gn): 


Thus by induction, if ~@ and @ agree on X then they agree on E”(X) for n < w, and hence 
they agree on Sg(X). 


Theorem 6.3. Let a: A — B be a homomorphism. Then the image of a subuniverse of A 
under a is a subuniverse of B, and the inverse image of a subuniverse of B is a subuniverse 
of A. 


PRooF. Let S be a subuniverse of A, let f be an n-ary member of J, and let aj,...,a, € S. 
Then 
fP(aa1,...,00n) = af*(a1,...,an) € a(S), 
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so a(S’) is a subuniverse of B. If we now assume that S is a subuniverse of B (instead of 
A) and a(a;),...,a(a,) € S then af*(a,,...,a,) € S follows from the above equation, so 
fA(ay,..-,@n) is in a~1(S). Thus a~1($) is a subuniverse of A. 


Definition 6.4. If a : A — Bis a homomorphism and C < A,D < B, let a(C) be the 
subalgebra of B with universe a(C), and let a~!(D) be the subalgebra of A with universe 
a~1(D), provided a~(D) 4 2. 


Theorem 6.5. Supposea : A — B and 8 : B — C are homomorphisms. Then the 
composition Boa is a homomorphism from A to C. 


ProorF. For f an n-ary function symbol and ay1,...,a) € A, we have 


(Go a) f*(a1, Ley Qn) = Blaf*(ar, ~++7Qn)) 
= BFP (aa,...,AAn) 
= f° (B(aa1),..., (aan) 
= f°((Boa)a,..., (G0 a)an). 


The next result says that homomorphisms commute with subuniverse closure operators. 
Theorem 6.6. Ifa: A —B is a homomorphism and X is a subset of A then 


a Sg(X) = Sg(aX). 


ProoF. From the definition of F (see §3) and the fact that a is a homomorphism we have 
ak(Y) = E(aY) 
for all Y C A. Thus, by induction on n, 
ak”"(X) = E"(aX) 
for n > 1; hence 


a 8g(X)=a(X U E(X)U F(X) vu...) 
= aX Uak(X)Uak?(x 
= aX UE(aX)U E?(aX 
= Sg(aX). 


ee 
Una! 
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Definition 6.7. Let a: A — B bea homomorphism. Then the kernel of a, written ker(a), 
is defined by 
ker(a) = {(a,b) € A? : a(a) = a(b)}. 


Theorem 6.8. Let a: A — B be a homomorphism. Then ker(a) is a congruence on A. 


ProoF. If (a;,b;) € ker(a) for 1 <i<nand f is n-ary in F, then 


af*(a1,-.-,@n) = fP(aai,..., an) 
= fP (aby, ..., abn) 
= OF (ieee DN): 


hence 
(f*(a1,-.-,@n), f*(b1,.--,bn)) € ker(a). 


Clearly ker(@) is an equivalence relation, so it follows that ker(a@) is actually a congruence 
on A. 


When studying groups it is usual to refer to the kernel of a homomorphism as a normal 
subgroup, namely the inverse image of the identity element under the homomorphism. This 
does not cause any real problems since we have already pointed out in 85 that a congruence 
on a group is determined by the equivalence class of the identity element, which is a normal 
subgroup. Similarly, in the study of rings one refers to the kernel of a homomorphism as a 
certain ideal. 

We are now ready to look at the straightforward generalizations to abstract algebras of 
the homomorphism and isomorphism theorems usually encountered in a first course on group 
theory. 


ELL 
LIL IL] 
y LILI 


A/® 


Figure 11 
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Definition 6.9. Let A be an algebra and let 6 € Con A. The natural map 1 : A > A/O 
is defined by vg(a) = a/0. (When there is no ambiguity we write simply v instead of 1.) 
Figure 11 shows how one might visualize the natural map. 

Theorem 6.10. The natural map from an algebra to a quotient of the algebra is an onto 
homomorphism. 


Proor. Let 6 € Con A and let vy : A — A/@ be the natural map. Then for f an n-ary 
function symbol and ay,...,an) € A we have 


Ut (Gist in SF (aise ccgan) 0 
= f*"(a,/0,...,an/8) 


= fA)" (vax, sical), 


so v is a homomorphism. Clearly v is onto. 


Definition 6.11. The natural homomorphism from an algebra to a quotient of the algebra 
is given by the natural map. 


Theorem 6.12 (Homomorphism Theorem). Suppose a: A — B is a homomorphism onto 
B. Then there is an isomorphism 3 from A/ker(a) to B defined by a = Gov, where v is 
the natural homomorphism from A to A/ker(a). (See Figure 12). 
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A/ker a 
Figure 12 
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ProoF. First note that if a = Gov then we must have G(a/@) = a(a). The second of these 
equalities does indeed define a function 3, and 6G satisfies a = Gov. It is not difficult to 
verify that @ is a bijection. To show that @ is actually an isomorphism, suppose f is an 


n-ary function symbol and aj,...,a@, € A. Then 
B(fA/9(a1/9, ais , an/0)) a B(fA(a, eae) An) /) 
Of ats canta) 
= fP (aa, mee Qn) 


= fP(B(a1/9),..., B(an/8)). 


Combining Theorems 6.5 and 6.12 we see that an algebra is a homomorphic image of 
an algebra A iff it is isomorphic to a quotient of the algebra A. Thus the “external” prob- 
lem of finding all homomorphic images of A reduces to the “internal” problem of finding 
all congruences on A. The homomorphism theorem is also called “the first isomorphism 
theorem” . 


Definition 6.13. Suppose A is an algebra and ¢@,4@ € Con A with 6 C ¢. Then let 
0/0 = {(a/0,b/0) € (A/8)? : (a,b) € o}. 


Lemma 6.14. /f¢,9 € Con A and @ C 4@, then @/0 is a congruence on A/O. 
ProoF. Let f be an n-ary function symbol and suppose (a;/6, b;/0) € 6/0, 1 <i <n. Then 
(a;,;) € @ (why?), so 
(f*(a1,---,@n), f*(h1,---;bn)) € 4, 
and thus 
(fA (a1,.--,@n)/0, f*(b1,..-,6n)/0) € 6/8. 


From this is follows that 


(f4/9(a1/0,...,Gn/0), FA! (b1/8,...,bn/O) € /0. 


Theorem 6.15 (Second Isomorphism Theorem). /f ¢,6 € Con A and 0 C ¢, then the map 
a: (A/0)/(¢/8@) > A/é 


defined by 
a((a/6)/(¢/0)) = a/¢ 
is an isomorphism from (A/0)/(¢/0) to A/@. (See Figure 13.) 
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of 6/0 
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(A/0) / (0/8) 
Figure 13 


PROOF. Let a,b € A. Then from 


(a/)/(e/8) = (0/0)/(6/0) iff a/p = 6/6 


it follows that a is a well-defined bijection. Now, for f an n-ary function symbol and 
(1,...,@4, € A we have 


af AOE! (a, /0)/(6/8), .- ., (an/9)/(/8)) = a f 4!" (21/9, ...,an/9)/(¢/8)) 
= a((f*(a1,...,@n)/0)/(6/0)) 
= f(a: SEO 

Sf" (Gi] 0 ).aisy0n/ D) 


= f*!9(a((a1/)/(G/9)), ---,0((an/9)/(9/9))), 


so a is an isomorphism. 


Definition 6.16. Suppose B is a subset of A and @ is a congruence on A. Let B® = {a € 
A: Bona/@ 4 }. Let B® be the subalgebra of A generated by B®. Also define @[g to be 


69 B?, the restriction of 0 to B. (See Figure 14, where the dashed-line subdivisions of A are 
the equivalence classes of 0.) 
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Figure 14 


Lemma 6.17. /f B is a subalgebra of A and @ € Con A, then 


(a) The universe of B® is B®. 
(b) Of g is a congruence on B. 


PRooF. Suppose f is an n-ary function symbol. For (a) let a,,...,a@, € B®. Then one can 
find b,,...,b, € B such that 
(a;, b;) € O, l<i<n, 
hence 
CF Gixe Gn) of  Oigseay On) e Us 
SO 


fA(at, 1 +5Qn) € B®. 


Thus B® is a subuniverse of A. Next, to verify that 0], is a congruence on B is straightfor- 
ward. 


Theorem 6.18 (Third Isomorphism Theorem). [f B is a subalgebra of A and @ € Con A, 
then (see Figure 15) 
B/Ol a= B’/O[ po . 


OOo OO OouoUuouo 
id ee 


(a: Te <a Pay TE els [el 
a SiS Memes 


B/ ols B®/6hg0 
Figure 15 
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PROOF. We leave it to the reader to verify that the map a defined by a(b/@[g) = 6/0} pe is 
the desired isomorphism. 


The last theorem in this section will be quite important in the subsequent study of 
subdirectly irreducible algebras. Before looking at this theorem let us note that if L is a 
lattice and a,b € L with a < 6 then the interval a, b] is a subuniverse of L. 


Definition 6.19. For [a,b] a closed interval of a lattice L, where a < 6, let [a,b] denote the 
corresponding sublattice of L. 


Theorem 6.20 (Correspondence Theorem). Let A be an algebra and let 0 € Con A. Then 
the mapping a defined on [6, V4] by 


a(¢) = 6/6 


is a lattice isomorphism from [0,V 4] to Con A/6, where [0, V4] is a sublattice of Con A. 
(See Figure 16.) 


ConA 
Figure 16 


PRooF. To see that a is one-to-one, let ¢,# € [0, V4] with ¢ 4 w. Then, without loss of 
generality, we can assume that there are elements a,b € A with (a,b) € 6 —w. Thus 


(a/6, b/@) € (~/8) — (w/8), 
so 
ale) # aly). 
To show that a is onto, let ~ € Con A/@ and define ¢ to be ker(vy,1). Then for a,b € A, 
(a/0,b/0) € 6/0 
iff (a,b) Eg 
iff (a/0,b/0) € ¥, 
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so 
o/0 = w. 
Finally, we will show that a is an isomorphism. If ¢, ¢ € [@, V4] then it is clear that 


ecw 
iff 9/8 y/O 
iff ad Caw. 


One can readily translate 6.12, 6.15, 6.18, and 6.20 into the (usual) theorems used in 
group theory and in ring theory. 


EXERCISES 86 


1. Show that, under composition, the endomorphisms of an algebra form a monoid, and 
the automorphisms form a group. 


2. Translate the isomorphism theorems and the correspondence theorem into results about 
groups [rings], replacing congruences by normal subgroups [ideals]. 


3. Show that a homomorphism a is an embedding iff ker a = A. 


4. If 9 € Con A and Con A is a modular [distributive] lattice then show Con A/6 is 
also a modular [distributive] lattice. 


5. Let a : A — B be a homomorphism, and X C A. Show that (a,b) € O(X) => 
(aa, ab) € O(aX). 


6. Given two homomorphisms a: A — B and 3: A — C, if ker @ C ker a and @ is onto, 
show that there is a homomorphism y : C > B such that a = yo (3. 


87. Direct Products, Factor Congruences, and 
Directly Indecomposable Algebras 
The constructions we have looked at so far, namely subalgebras and quotient algebras, do 


not give a means of creating algebras of larger cardinality than what we start with, or of 
combining several algebras into one. 
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Definition 7.1. Let A; and Ag be two algebras of the same type F. Define the (direct) 
product A; x A» to be the algebra whose universe is the set A; x A», and such that for 
f © F, and a; € Aj, a, € Ag, l<i<n, 


fA GiGi eng AO Gy) =A PGi cies Gn (Apahsa dy) 


In general neither A; nor A» is embeddable in A; x Ag, although in special cases like 
groups this is possible because there is always a trivial subalgebra. However, both A; and 
Ay» are homomorphic images of A; x Ag. 


Definition 7.2. The mapping 
™,: Ay X Ap > Aj, te {1,2}; 


defined by 
T((A1, A2)) = Gi, 


is called the projection map on the ith coordinate of A, x Ag. 


Theorem 7.3. Fori = 1 or 2 the mapping 7; : A, x Ag — A; is a surjective homomorphism 
from A = A, x Ag to Aj. Furthermore, in Con A; x Ay we have 


ker 7 O ker m2 = A, 


ker 7, and ker 1. permute, 


and 
ker 71 V ker. = V. 


PROOF. Clearly 7; is surjective. If f € F, and a; € Aj,a, € Ao, 1 <i<n, then 


mi(f*((a1, a4), eee) (Gaya) = mi((f"* (a1, ars 1 On), (Gee Deans ,d;,))) 
Seta) 


= {™ (m1 ((a1, ai)), ae ™((n, Gi 


so 7 is a homomorphism; and similarly 72 is a homomorphism. 


Now 
((a1, @2), (bi, b2)) € ker 7; 
iff Ti ({@1, @2)) = Ti({d1, be) ) 
Thus 


ker 7, MN ker 72 = A. 
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Also if (a1, a2), (b1, b2) are any two elements of A; x Ag then 
(a1, @2) ker 7 (a1, bz) ker 7(b1, be), 


sO 
V =ker7 0 ker 7p. 


But then ker 7, and ker 72 permute, and their join is V. 


The last half of Theorem 7.3 motivates the following definition. 


Definition 7.4. A congruence @ on A is a factor congruence if there is a congruence 6* on 
A such that 


One =A, 
OVOT=V, 


and 
0 permutes with 6”. 


The pair 6, 6* is called a pair of factor congruences on A. 
Theorem 7.5. [f0,0* is a pair of factor congruences on A, then 
A=A/6Ox A/#* 


under the map 


a(a) = (a/0,a/6*). 


ProoF. If a,b € A and 


a(a) = a(b) 
then 
a/@ = b/0 and a/@ = b/0", 
so 
(a,b) € 6 and (a,b) € 0; 
hence 


a= b. 


This means that a is injective. Next, given a,b € A there isace€ A with 


abcO"b ; 
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hence 


a(c) = (c/8,¢/6") 
= (a/0,b/0"), 


so a is onto. Finally, for f € F, and aj,...,a, € A, 


Cor ee SP (a) Of a dO 
= (f4/9(a,/0,...,n/0), FA! (a, /0",...,an/8")) 
= fA CanslO, GO) ats 4 OSa Uy) 


= fA/xA/ (vay, ..., dn); 


hence a is indeed an isomorphism. 
Thus we see that factor congruences come from and give rise to direct products. 


Definition 7.6. An algebra A is (directly) indecomposable if A is not isomorphic to a direct 
product of two nontrivial algebras. 


EXAMPLE. Any finite algebra A with |A| a prime number must be directly indecomposable. 
From Theorems 7.3 and 7.5 we have the following. 


Corollary 7.7. A is directly indecomposable iff the only factor congruences on A are A 
and V. 


We can easily generalize the definition of A; x A» as follows. 


Definition 7.8. Let (A,)jc; be an indexed family of algebras of type ¥. The (direct) product 
A = [],<, Ai is an algebra with universe [[,-, A; and such that for f € F, and aj,...,an € 
ie] 47) 
f Cin eGa) =f" a@se,40@) 
for i € I, ie., f* is defined coordinate-wise. The empty product |] @ is the trivial algebra 
with universe {@}. As before we have projection maps 


15 : I] A; = A; 
ie] 
for 7 € I defined by 
m;(a) = a9) 
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If J = {1,2,...,n} we also write Ay x --- x Ap. If J is arbitrary but A; = A for all i € J, 
then we usually write A’ for the direct product, and call it a (direct) power of A. A® is a 
trivial algebra. 

A direct product [],_,; A; of sets is often visualized as a rectangle with base J and vertical 
cross sections A;. An element a of [Le , A; is then a curve as indicated in Figure 17. Two 
elementary facts about direct products are stated next. 


Figure 17 


Theorem 7.9. [f A;, Ao, and Ag are of type F then 


(a) Ay x Ap = Ay x Ay under a((ay, a2)) = (a2, a1). 
(b) Ay x (Ag x A3) = Ay x Ag x Ag under a((ay, (a2, a3))) = (a1, a2, a3). 


PROOF. (Exercise.) 


In Chapter IV we will see that there is up to isomorphism only one nontrivial directly in- 
decomposable Boolean algebra, namely a two-element Boolean algebra, hence by cardinality 
considerations it follows that a countably infinite Boolean algebra cannot be isomorphic to 
a direct product of directly indecomposable algebras. On the other hand for finite algebras 
we have the following. 


Theorem 7.10. Every finite algebra is isomorphic to a direct product of directly indecom- 
posable algebras. 


ProoF. Let A be a finite algebra. If A is trivial then A is indecomposable. We proceed by 
induction on the cardinality of A. Suppose A is a nontrivial finite algebra such that for every 
B with |B| < |A| we know that B is isomorphic to a product of indecomposable algebras. If 
A is indecomposable we are finished. If not, then A ~ A, x A» with 1 < |Aj,|,|Ao|. Then, 
|A;|, |A2| < |A|, so by the induction hypothesis, 


A, =B,x.---x Bn, 
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where the B; and C; are indecomposable. Consequently, 


A=>B, x:---x By x Cy x +++ x Cp. 


Using direct products there are two obvious ways (which occur a number of times in 
practice) of combining families of homomorphisms into single homomorphisms. 


Definition 7.11. (i) If we are given maps a; : A — A;,i € I, then the natural map 
a:A-7> I] A; 
i€l 
is defined by 
(aa) (i) = aya. 
(ii) If we are given maps a; : A; > B;, i € I, then the natural map 
Qa: I] A; => ][2 
iel ie] 


is defined by 


Theorem 7.12. (a) Ifa;: A — Aj, i € I, is an indezed family of homomorphisms, then 
the natural map a is a homomorphism from A to A* = [],-; Ai- 

(b) Ifa; : A; > Bi, 7 € I, is an indexed family of homomorphisms, then the natural map 
a is a homomorphism from A* = |],-; Ai to B* = [],-; Bi. 
PROOF. Suppose a; : A — A, is a homomorphism for 7 € J. Then for a,,...,a, € A and 
f © Fn we have, fori € J, 


(af*(a1,...,4n))(4) = a4 f*(a1,...,@n) 


= f*(aja1,...,040n) 
= f*((aa1)(i),..., (aan) (@)) 
— f*"(aa1,...,4n)(4); 
hence 
af*(a1,...,4n) = f* (aa1,...,00n), 


so @ is indeed a homomorphism in (a) above. Case (b) is a consequence of (a) using the 
homomorphisms a; 0 7. 
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Definition 7.13. If a,,a2 € A anda: A— Bis a map we say a separates a; and az if 
aa, # Adg. 


The maps a; : A — A;,i € I, separate points if for each a,,a2 € A with a, ¥ az there is an 
qa; such that 


a;(a1) A a; (a2). 


Lemma 7.14. For an indexed family of maps a; : A — Aj, i € I, the following are 
equivalent: 


(a) The maps a; separate points. 
(b) a is injective (a is the natural map of 7.11(a)). 
(c) (jez ker a; = A. 


PROOF. (a) = (b): Suppose aj, a2 € A and a, # ag. Then for some i, 


a;(a1) 4 ai (a2); 
hence 
(aa1)(t) A (aag) (2) 
aa, # Ady. 


(b) = (c): For a,,a2 € A with a, 4 ay, we have 
aa, # Ad9; 


hence 
(aay) (7) F (ag) () 
for some 7, so 
Ajay, f aja2g 
for some 7, and this implies 


(a1, @2) ¢ ker aj, 


SO 
() ker a; = A. 
ie] 


(c) > (a): For aj, a2 € A with a, F aa, 


(a1, 2) £ (ker on 


we. 
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so, for some 7, 
(a1, @2) ¢ ker a;; 
hence 
ajay, es Q;a2. 


Theorem 7.15. Jf we are given an indexed family of homomorphisms a;: A — Aj, 7 € I, 
then the natural homomorphism a: A — [J,-; Ai ts an embedding iff (je; ker ai = A iff the 
maps a, separate points. 


PRooF. This is immediate from 7.14. 


EXERCISES 87 


1. If 6,6* € Con A show that they form a pair of factor congruences on A iff @N6* = A 
and @o#* =V. 


2. Show that (Con A,) x (Con A2) can be embedded in Con A, x Ag. 
3. Give examples of arbitrarily large directly indecomposable finite distributive lattices. 


4. If Con A is a distributive lattice show that the factor congruences on A form a com- 
plemented sublattice of Con A. 


5. Find two algebras A, A» such that neither can be embedded in A, x Ag. 


88. Subdirect Products, Subdirectly Irreducible 
Algebras, and Simple Algebras 


Although every finite algebra is isomorphic to a direct product of directly indecomposable 
algebras, the same does not hold for infinite algebras in general. For example, we see that a 
denumerable vector space over a finite field cannot be isomorphic to a direct product of one- 
dimensional spaces by merely considering cardinalities. The quest for general building blocks 
in the study of universal algebra led Birkhoff to consider subdirectly irreducible algebras. 


Definition 8.1. An algebra A is a subdirect product of an indexed family (A;);<-7 of algebras 
if 

(i) A< Tey Ai 
and 

(ii) 7;(A) = A, for each i € I. 
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An embedding a: A => [],-,; Ai is subdirect if a(A) is a subdirect product of the Aj. 


i€l 
Note that if J = @ then A is a subdirect product of @ iff A = [] 9, a trivial algebra. 


Lemma 8.2. [f 6; € Con A fori € I and (),-,6: = A, then the natural homomorphism 
y:A—> I] A/6; 
iel 
defined by 
v(a)(t) = a/6; 
is a subdirect embedding. 
Proor. Let v; be the natural homomorphism from A to A/@; for i € I. As kerv; = 4;, 


it follows from 7.15 that v is an embedding. Since each v; is surjective, v is a subdirect 
embedding. 


Definition 8.3. An algebra A is subdirectly irreducible if for every subdirect embedding 


a: A>][A 
ic 
there is an 7 € I such that 
T,0a@:A—> A; 


is an isomorphism. 


The following characterization of subdirectly irreducible algebras is most useful in prac- 
tice. 


Theorem 8.4. An algebra A is subdirectly irreducible iff A is trivial or there is a minimum 
congruence in Con A — {A}. In the latter case the minimum element is (\(Con A — {A}), 
a principal congruence, and the congruence lattice of A looks like the diagram in Figure 18. 


V 


oe (\(Con A - {A}) 


A 
Figure 18 
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ProoFr. (=) If A is not trivial and Con A— {A} has no minimum element then (|(Con A — 
{A}) =A. Let J = Con A — {A}. Then the natural map a: A — [[,,, A/@ is a subdirect 
embedding by Lemma 8.2, and as the natural map A — A/@ is not injective for 0 € J, it 
follows that A is not subdirectly irreducible. 

(<=) If A is trivial and a: A — [J,., A; is a subdirect embedding then each A; is 
trivial; hence each 7; 0 a is an isomorphism. So suppose A is not trivial, and let 9 = 
(\(Con A—{A}) # A. Choose (a,b) € 6, a #b. Ifa: A — [],-; Ai is a subdirect embedding 
then for some i, (aa)(i) 4 (ab)(z); hence (7; 0 a)(a) 4 (m0 a)(b). Thus (a,b) ¢ ker(7; 0 a) 
so 0 £ ker(m; 0 a). But this implies ker(m; 0a) = A, so 7;0 a: A — A; is an isomorphism. 
Consequently A is subdirectly irreducible. 

If Con A — {A} has a minimum element 6 then for a # b and (a,b) € 6 we have 
O(a,b) C 6, hence 6 = O(a, d). 


Using 8.4, we can readily list some subdirectly irreducible algebras. 


EXAMPLES. (1) A finite Abelian group G is subdirectly irreducible iff it is cyclic and |G] = p” 
for some prime p. 
(2) The group Z,~ is subdirectly irreducible. 
(3) Every simple group is subdirectly irreducible. 
(4) A vector space over a field F' is subdirectly irreducible iff it is trivial or one-dimensional. 
(5) Any two-element algebra is subdirectly irreducible. 


A directly indecomposable algebra need not be subdirectly irreducible. For example 
consider a three-element chain as a lattice. But the converse does indeed hold. 


Theorem 8.5. A subdirectly irreducible algebra is directly indecomposable. 


PROOF. Clearly the only factor congruences on a subdirectly irreducible algebra are A and 
V, so by 7.7 such an algebra is directly indecomposable. 


Theorem 8.6 (Birkhoff). Every algebra A is isomorphic to a subdirect product of subdirectly 
irreducible algebras (which are homomorphic images of A). 


ProoF. As trivial algebras are subdirectly irreducible we only need to consider the case of 
nontrivial A. For a,b € A with a ¢ b we can find, using Zorn’s lemma, a congruence 6,5 
on A which is maximal with respect to the property (a,b) ¢ @.,. Then clearly O(a, b) V 
6,» is the smallest congruence in [0,.5, V] — {Aan}, so by 6.20 and 8.4 we see that A/@., 
is subdirectly irreducible. As (){0Qa, : a # b} = A we can apply 8.2 to show that A 
is subdirectly embeddable in the product of the indexed family of subdirectly irreducible 
algebras (A/6.,p)azo- 


An immediate consequence of 8.6 is the following. 
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Corollary 8.7. Every finite algebra is isomorphic to a subdirect product of a finite number 
of subdirectly irreducible finite algebras. 


Although subdirectly irreducible algebras do form the building blocks of algebra, the 
subdirect product construction is so flexible that one is often unable to draw significant 
conclusions for a class of algebras by studying its subdirectly irreducible members. In some 
special yet interesting cases we can derive an improved version of Birkhoff’s theorem which 
permits a much deeper insight—this will be the theme of Chapter IV. 

Next we look at a special kind of subdirectly irreducible algebra. This definition extends 
the usual notion of a simple group or a simple ring to arbitrary algebras. 


Definition 8.8. An algebra A is simple if Con A = {A, V}. A congruence @ on an algebra 
A is maximal if the interval [0, V] of Con A has exactly two elements. 


Many algebraists prefer to require that a simple algebra be nontrivial. For our devel- 
opment, particularly for the material in Chapter IV, we find the discussion smoother by 
admitting trivial algebras. 


Just as the quotient of a group by a normal subgroup is simple and nontrivial iff the 
normal subgroup if maximal, we have a similar result for arbitrary algebras. 


Theorem 8.9. Let 6 € Con A. Then A/O is a simple algebra iff 0 is a maximal congruence 
on A or9@=V. 


PROOF. We know that 
Con A/6 = [6, Va] 


by 6.20, so the theorem is an immediate consequence of 8.8. 


REFERENCE 


1. G. Birkhoff [1944] 


EXERCISES 88 


1. Represent the three-element chain as a subdirect product of subdirectly irreducible 
lattices. 


2. Verify that the examples following 8.4 are indeed subdirectly irreducible algebras. 


3. (Wenzel). Describe all subdirectly irreducible mono-unary algebras. [In particular 
show that they are countable.] 
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10. 


1a 


§9. 
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(Taylor). Let A be the set of functions from w to {0,1}. Define the bi-unary algebra 
(A, f, 9) by letting 


Show that A is subdirectly irreducible. 


(Taylor). Given an infinite cardinal \ show that one can construct a unary algebra A 
by size 2* with \ unary operations such that A is subdirectly irreducible. 


Describe all subdirectly irreducible Abelian groups. 


If S is a subdirectly irreducible semilattice show that |S| < 2. (Use §5 Exercise 9.) 
Hence show that every semilattice is isomorphic to a semilattice of the form (A,N), 
where A is a family of sets closed under finite intersection. 


. A congruence 6 on A is completely meet irreducible if whenever 0 = (),-, 9;,0; € Con A, 


we have 0 = 6;, for some i € J. Show that A/O is subdirectly irreducible iff 0 is 
completely meet irreducible. (Hence, in particular, A is subdirectly irreducible iff A 
is completely meet irreducible.) 


. If H = (A,V,A,—,0,1) is a Heyting algebra and a € H define 6, = {(b,c) € H?: 


(b > c) \(c > b) > a}. Show that 0, is a congruence on H. From this show that H is 
subdirectly irreducible iff |H] = 1 or there is an element e £ 1 such thatb41>b<e 
for b € H. 


Show that the lattice of partitions (II(A), C) of a set A is a simple lattice. 


If A is an algebra and 6; € Con A, 7 € J, let 0 = a: 6;. Show that A/@ can be 
subdirectly embedded in [],-; A/6. 


tel 


Class Operators and Varieties 


A major theme in universal algebra is the study of classes of algebras of the same type closed 
under one or more constructions. 


Definition 9.1. We introduce the following operators mapping classes of algebras to classes 
of algebras (all of the same type): 


A € 1(K) iff A is isomorphic to some member of 
A € S(K) iff A is a subalgebra of some member of 
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A € H(Kk) iff A is a homomorphic image of some member of 
A € P(Kk) iff A is a direct product of a nonempty family of algebras in 
A € Ps(K) iff A is a subdirect product of a nonempty family of algebras in Kk. 


If O; and Oy are two operators on classes of algebras we write O,;O2 for the composition of 
the two operators, and < denotes the usual partial ordering, i.e., O; < O2 if O;(K) C Oo(K) 
for all classes of algebras K. An operator O is idempotent if O? = O. A class K of algebras 
is closed under an operator O if O(K) C Kk. 

Our convention that P and Ps apply only to non-empty indexed families of algebras is the 
convention followed by model theorists. Thus for any operator O above, O(@) = @. Many 
algebraists prefer to include | [ @, guaranteeing that P(A’) and Ps(/’) always contain a trivial 
algebra. However this leads to problems formulating certain preservation theorems—see V§2. 
For us [| @ is really used only in IV81, §5 and §7. 


Lemma 9.2. The following inequalities hold: SH < HS, PS < SP, and PH < HP. Also 
the operators, H,S, and IP are idempotent. 


PROOF. Suppose A = SH(Kk). Then for some B € K and onto homomorphism a: B — C, 
we have A < C. Thus a~!(A) < B, and as a(a7!(A)) = A, we have A € HS(K). 

If A € PS(K) then A = JJ,., A; for suitable A; < B; ¢ K, ie J. As[J,-, Ai <I] 
we have A € SP(K). 

Next if A € PH(IK), then there are algebras B; € K and epimorphisms a; : B; — A; 
such that A = [],., Aj. It is easy to check that the mapping a : [],., B; — [],-; Ai defined 
by a(b)(i) = a;(b(z)) is an epimorphism; hence A € HP(K). 

Finally it is a routine exercise to verify that H? = H, etc. 


ie] iel Bi, 


Definition 9.3. A nonempty class K of algebras of type F is called a variety if it is closed 
under subalgebras, homomorphic images, and direct products. 


As the intersection of a class of varieties of type F is again a variety, and as all algebras 
of type F form a variety, we can conclude that for every class K of algebras of the same type 
there is a smallest variety containing K. 


Definition 9.4. If K is a class of algebras of the same type let V(/) denote the smallest 
variety containing K. We say that V(K) is the variety generated by K. If Kk has a single 
member A we write simply V(A). A variety V is finitely generated if V = V(K) for some 
finite set K of finite algebras. 


Theorem 9.5 (Tarski). V = HSP. 


PROOF. Since HV = SV =IPV =V and I < V it follows that HSP < HS PV = V. From 
Lemma 9.2 we see that H(HSP) = HSP,S(HSP) < HSSP = HSP, and P(HSP) < 
APSP < HSPP < HSIPIP = HSIP < HSHP < HHSP = HSP; hence for any 
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K, HSP(K) is closed under H, S, and P. As V(ix) is the smallest class containing AK and 
closed under H, S, and P, we must have V = HSP. 


Another description of the operator V will be given at the end of 811. The following 
version of Birkhoff’s Theorem 8.6 is useful in studying varieties. 


Theorem 9.6. Jf K is a variety, then every member of K is isomorphic to a subdirect 
product of subdirectly irreducible members of K. 


Corollary 9.7. A variety is determined by its subdirectly irreducible members. 
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EXERCISES 89 
1. Show that [SP(K) is the smallest class containing A and closed under J, S, and P. 
2. Show HS 2 SH, HP AIPA, ISPZIPS. 
3. Show ISPHS £ISHPS 4 IASP. 


4. (Pigozzi). Show that there are 18 distinct class operators of the form [O,---O, where 
O; € 44,5, P} for 154 < 1. 


5. Show that if V has the CEP (see §5 Exercise 10) then for K CV, HS(k) =SH(K). 


810. Terms, Term Algebras, and Free Algebras 


Given an algebra A there are usually many functions besides the fundamental operations 
which are compatible with the congruences on A and which “preserve” subalgebras of A. The 
most obvious functions of this type are those obtained by compositions of the fundamental 
operations. This leads us to the study of terms. 


Definition 10.1. Let X be a set of (distinct) objects called variables. Let F be a type of 
algebras. The set T(X) of terms of type ¥ over X is the smallest set such that 


Gy Use 700, 
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(ii) If pi,...,pn € T(X) and f € F, then the “string” f(p1,...,pn) € T(X). 


For a binary function symbol - we usually prefer p;- po to -(p1, po). For p € T(X) we often 
write p as p(21,...,£n) to indicate that the variables occurring in p are among %1,...,Xn. A 
term p is n-ary if the number of variables appearing explicitly in p is < n. 


EXAMPLES. (1) Let F consist of a single binary function symbol -, and let X = {z, y, z}. 
Then 
TY, 2,0 °Y,Y 2, 20° (y- 2), and (x -y) °&z 
are some of the terms over X. 
(2) Let F consist of two binary operation symbols + and -, and let X be as before. Then 


Dye eyes) and (ey) (Ez) 


are some of the terms over X. 

(3) The classical polynomials over the field of real numbers R are really the terms as 
defined above of type F consisting of +,-, and — together with a nullary function symbol r 
for each r € R. 


In elementary algebra one often thinks of an n-ary polynomial over R as a function from 
R” to R for some n. This can be applied to terms as well. 


Definition 10.2. Given a term p(x1,..., 2») of type F over some set X and given an algebra 
A of type F we define a mapping p“ : A” — A as follows: 
(1) if p is a variable z;, then 


ig C Coee ae 
for a1,...,@n € A, ie., p* is the ith projection map; 
(2) if p is of the form f(pi(%1,..-,2%n),---,Pk(©1,---;2n)), where f € F,, then 
PP Cis se VT De las one a ee Ae Oa eG, 8 


In particular if p = f € F then p“ = fA. p“ is the term function on A corresponding to the 
term p. (Often we will drop the superscript A). 


The next theorem gives some useful properties of term functions, namely they behave 
like fundamental operations insofar as congruences and homomorphisms are concerned, and 
they can be used to describe the closure operator Sg of 83 in a most efficient manner. 


Theorem 10.3. For any type F and algebras A, B of type F we have the following. 
(a) Let p be an n-ary term of type F, let @ € Con A, and suppose (a;,b;) € 0 forl <i<n. 
Then 
p* (a1, ...,4n)0p™(b1,..., dn). 
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(b) If p is an n-ary term of type F¥ anda: A — B is a homomorphism, then 
ap*(a1,...,4n) = p?(aa1,..., An) 


for aj,...,4n € A. 
(c) Let S be a subset of A. Then 


Se(S) = {p*(a1,...,@n) : p is an n-ary term of type F, n<w, and ay,...,an € S}. 


PROOF. Given a term p define the length I(p) of p to be the number of occurences of n-ary 
operation symbols in p for n > 1. Note that I(p) = 0 iff p€ X UF. 
(a) We proceed by induction on I(p). If [(p) = 0, then either p = x; for some 7, whence 


(p* (a1, Hoe Gm), p*(b1, wae »bn)) (ai, bi) E@ 


or p = a for some a € Fo, whence 
(DB (Giy xu O Sp Digsens OR = (a a) SG: 


Now suppose [(p) > 0 and the assertion holds for every term q with I(q) < l(p). Then we 
know p is of the form 


CCCs rere 8 remem #78 se, Remmrneseea 
and as I(p;) < l(p) we must have, for 1 <i<k, 
(pi(a1,--+54n), PP (b1,---5bn)) € 8; 


hence 


Cl Oi (aay 3 a) ae iy) On OG nea edn) = 0, 


and consequently 
(p™ (G4). ++; n), DB (bi, +--,bn)) € 0. 


(b) The proof of this is an induction argument on I(p). 
(c) Referring to §3 one can give an induction proof, for k > 1, of 


E*(S) = {p*(a1,...,@n) : p is an n-ary term, I(p) <k, n<w, a,...,an € St, 
and thus 


Sg(S) = U E*(S) = {p*(a,,..., an): p is an n-ary term, n <w,44,...,4m € S}. 
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One can, in a natural way, transform the set 7X) into an algebra. 
Definition 10.4. Given F and X, if T(X) # @ then the term algebra of type F over X, 


written T(X), has as its universe the set T(X), and the fundamental operations satisfy 


ime : DivorsDa) Pe E( Digs esPr) 


for f € F, and p; € T(X), 1<i<n. (T(Q) exists iff Fp  G.) 


Note that T(X) is indeed generated by X. Term algebras provide us with the simplest 
examples of algebras with the universal mapping property. 


Definition 10.5. Let K be a class of algebras of type F and let U(X) be an algebra of type 
F which is generated by X. If for every A € K and for every map 


a:X oA 


there is a homomorphism 
B:U(X)-A 
which extends a (i.e., G(x) = a(x) for « € X), then we say U(X) has the universal mapping 


property for K over X, X is called a set of free generators of U(X), and U(X) is said to be 
freely generated by X. 


Lemma 10.6. Suppose U(X) has the universal mapping property for kK over X. Then if 
we are given A € K anda: X — A, there is a unique extension 3 of a such that GB is a 
homomorphism from U(X) to A. 


PRooF. This follows simply from noting that a homomorphism is completely determined 
by how it maps a set of generators (see 6.2) from the domain. 


The next result says that for a given cardinal m there is, up to isomorphism, at most 
one algebra in a class K which has the universal mapping property for K over a set of free 
generators of size m. 


Theorem 10.7. Suppose U;(X,) and U2(X2) are two algebras in a class K with the univer- 
sal mapping property for K over the indicated sets. If |X | = |X|, then U,(X1) = U2(X2). 


PROOF. First note that the identity map 
te Ne pee. 


has as its unique extension to a homomorphism from U,(X,;) to U;(X;) the identity map. 
Now let 
Qa: Xy =a Xo 
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be a bijection. Then we have a homomorphism 

GB: Ui(X1) > U2(X2) 
extending a, and a homomorphism 

y : U2(X2) = U1(X1) 


extending a~'. As 307 is an endomorphism of U2(X2) extending 22, it follows by 10.6 that 
Gov is the identity map on U2(X2). Likewise 70 ( is the identity map on U;(X,). Thus 3 
is a bijection, so U;(X,) = U2(X2). 


Theorem 10.8. For any type F¥ and set X of variables, where X # @ if Fo = @, the term 
algebra T(X) has the universal mapping property for the class of all algebras of type ¥ over 
Xx, 


PRooF. Let a: X — A where A is of type F. Define 
B:T(X)—-A 
recursively by 
Cr= az 


for x € X, and 
BO ipsa Pn) =F pigse se Pn) 


for p1,..-,;Pn € T(X) and f € F,. Then @(p(a1,...,%n)) = p*(az1,...,aX,), and @ is the 
desired homomorphism extending a. 


Thus given any class K of algebras the term algebras provide algebras which have the 
universal mapping property for K. To study properties of classes of algebras we often try to 
find special kinds of algebras in these classes which yield the desired information. Directly 
indecomposable and subdirectly irreducible algebras are two examples which we have already 
encountered. In order to find algebras with the universal mapping property for K which 
give more insight into K we will introduce K-free algebras. Unfortunately not every class kK” 
contains algebras with the universal mapping property for kK. Nonetheless we will be able to 
show that any class closed under J, S, and P contains its K-free algebras. There is reasonable 
difficulty in providing transparent descriptions of K-free algebras for most kK. However, most 
of the applications of K-free algebras come directly from the universal mapping property, 
the fact that they exist in varieties, and their relation to identities holding in K (which 
we will examine in the next section). A proper understanding of free algebras is essential 
in our development of universal algebra—we use them to show varieties are the same as 
classes defined by equations (Birkhoff), to give useful characterizations (Mal’cev conditions) 
of important properties of varieties, and to show every nontrivial variety contains a nontrivial 
simple algebra (Magari). 


§10. Terms, Term Algebras, and Free Algebras 73 


Definition 10.9. Let K be a family of algebras of type F. Given a set X of variables define 
the congruence 0, (X) on T(X) by 


Ox(X) =( }&x(X), 
where 
Ox (X) = {¢ € Con T(X) : T(X)/¢ € IS(K)}; 
and then define F(X), the K-free algebra over X, by 


F(X) = T(X)/Ox(X), 


where 

X = X/On(X). 
For x € X we write Z for x/0x(X), and for p = p(%,...,2n) € T(X) we write Dp for 
pe«(4)(z,,...,E,). If X is finite, say X= {1,...,%,}, we often write Fx(%,...,%,) for 


F(X). F(X) is the universe of Fx(X). 


Remarks. 


(1) Fx(X) exists iff T(X) exists iff X 4 @ or Fo ¥ @. (2) If Fx(X) exists, then X is a set of 
generators of Fx(X) as X generates T(X). (3) If Fp 4 S, then the algebra Fx(@) is often 
referred to as an initial object by category theorists and computer scientists. (4) If kK = @ or 
K consists solely of trivial algebras, then Fx (X) is a trivial algebra as 0x(X) = V. (5) If K 
has a nontrivial algebra A and T(X) exists, then X N(x/0x(X)) = {x} as distinct members 
x,y of X can be separated by some homomorphism a : T(X) — A. In this case |X| = |X]. 
(6) If |X| = |Y| and T(X) exists, then clearly Fx(X) & Fx(Y) under an isomorphism which 
maps X to Y as T(X) & T(Y) under an isomorphism mapping X to Y. Thus Fx(X) is 
determined, up to isomorphism, by K and |X]. 


Theorem 10.10 (Birkhoff). Suppose T(X) exists. Then F(X) has the universal mapping 
property for K over X. 


Proor. Given A € K let a be a map from X to A. Let v : T(X) — F(X) be the 
natural homomorphism. Then aov maps X into A, so by the universal mapping property 
of T(X) there is a homomorphism ps: T(X) — A extending aov} x . From the definition 
of 0x (X) it is clear that 0x(X) C ker ps (as ker p € ®x(X)). Thus there is a homomorphism 


B: F(X) — A such that up = Gov (see §6 Exercise 6) as kery = 6x(X). But then, for 
rEX, 


B(@) = Bov(x) 
= (2) 
=aov(z) 


= a(Z), 
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so 3 extends a. Thus Fx(X) has the universal mapping property for K over X. 


If Fx(X) € K then it is, up to isomorphism, the unique algebra in K with the universal 
mapping property freely generated by a set of generators of size |X|. Actually every algebra 
in K with the universal mapping property for K is isomorphic to a K-free algebra (see 
Exercise 6). 


EXAMPLES. (1) It is clear that T(X) is isomorphic to the free algebra with respect to the 
class K of all algebras of type F over X since 0x(X) = A. The corresponding free algebra 
is sometimes called the absolutely free algebra F(X) of type F. 

(2) Given X let X* be the set of finite strings of elements of X, including the empty 
string. We can construct a monoid (X%*,-,1) by defining - to be concatenation, and 1 is the 
empty string. By checking the universal mapping property one sees that (X*,-,1) is, up to 
isomorphism, the free monoid freely generated by X. 


Corollary 10.11. Jf K is a class of algebras of type F and A € K, then for sufficiently 


large X, A € H(FK(X)). 


PROOF. Choose |X| > |A| and let _ 
a:X oA 


be a surjection. Then let a 
B : F(X) — A 


be a homomorphism extending a. 


In general Fx(X) is not isomorphic to a member of K (for example, let K = {L} where 


L is a two-element lattice; then Fx(Z,y) ¢ I(K)). However Fx(X) can be embedded in a 
product of members of K. 


Theorem 10.12 (Birkhoff). Suppose T(X) exists. Then for K 4 @, Fx(X) € ISP(K). 


Thus if K is closed under I, S, and P, in particular if Kk is a variety, then Fx(X) € K. 


PrRoor. As 
On(X) = ()&x(X) 


it follows (see §8 Exercise 11) that 
F(X) = T(X)/0«(X) € IPs({T(X)/8: 6 € ®x(X)}), 


SO 
F(X) € IPsIS(K), 
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and thus by 9.2 and the fact that Ps < SP, 


Fx(X) € ISP(K). 


From an earlier theorem of Birkhoff we know that if a variety has a nontrivial algebra in 
it then it must have a nontrivial subdirectly irreducible algebra in it. The next result shows 
that such a variety must also contain a nontrivial simple algebra. 


Theorem 10.13 (Magari). Jf we are given a variety V with a nontrivial member, then V 
contains a nontrivial simple algebra. 


Proor. Let X = {x,y}, and let 


S = {p(z):peT({x})}, 


a subset of F(X). First suppose that O(S) # V in Con Fy(X). Then by Zorn’s lemma 
there is a maximal element in [O(S), V] — {V}. (The key observation for this step is that for 
6 € (0(S), V], 

6=V iff (2,9) € 80. 


To see this note that if (7,7) € 6 and O(S) C 6, then for any term p(z,y), with F = Fy(X) 
we have 

p'(&, W)op* (F, B)O(S)m; 
hence @ = V.) Let 6) be a maximal element in [O(.S), V] — {V}. Then Fy(X)/0o is a simple 
algebra by 8.9, and it is in V. 

If, however, O(.S') = V, then since O is an algebraic closure operator by 5.5, it follows 
that for some finite subset Sg of S we must have (Z,7¥) € O(S_). Let S be the subalgebra 
of Fy(X) with universe S (note that S = Sg({z}) by 10.3(c)). As V is nontrivial we must 
have Z # y in Fy(X), and as (7,7) € O(S) it follows that S is nontrivial. Now we claim 
that Vs = O(So), where 9 in this case is understood to be the appropriate closure operator 
on S. To see this let p(%) € S and let 


a:Fy(X)-S 
be the homomorphism defined by 
Ot) =z 
a(y) = p(Z) 


As 
(Z,9) € O(So) in Fy(X), 
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it follows from 6.6 (see §6 Exercise 5) that 
(F, p(Z)) € O(So) in $ 


as 
a(So) = So. 


This establishes our claim; hence using Zorn’s lemma we can find a maximal congruence 0 
on S as Vg is finitely generated. Hence S/0 is a simple algebra in V. 


Let us turn to another application of free algebras. 


Definition 10.14. An algebra A is locally finite if every finitely generated subalgebra (see 
83.4) is finite. A class K of algebras is locally finite if every member of K is locally finite. 


Theorem 10.15. A variety V is locally finite iff 


|X| <w => |Fy(X)| <w. 


Proor. The direction (=) is clear as X generates Fy(X). For (=) let A be a finitely 
generated member of V, and let B C A be a finite set of generators. Choose X such that we 
have a bijection 

a: X —B. 


Extend this to a homomorphism _ 


As 3 (Fy(X)) is a subalgebra of A containing B, it must equal A. Thus ( is surjective, and 
as F(X) is finite so is A. 


Theorem 10.16. Let K be a finite set of finite algebras. Then V(K) is a locally finite 
variety. 


ProoF. First verify that P(J‘) is locally finite. To do this define an equivalence relation 
~ on T({x1,...,%}) by p ~ q if the term functions corresponding to p and q are the 
same for each member of Kk. Use the finiteness conditions to show that ~ has finitely many 
equivalence classes. This, combined with 10.3(c), suffices. Then it easily follows that V is 
locally finite since every finitely generated member of HS P(K) is a homomorphic image of 
a finitely generated member of S'P(Ix). 
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EXERCISES $10 


ale 


Let L be the four-element lattice ({0,a,b,1},V,A) where 0 is the least element, 1 is 
the largest element, and a\b=0, aVb= 1 (the Hasse diagram is Figure 1(c)). Show 
that L has the universal mapping property for the class of lattices over the set {a, b}. 


Let A = (w, f) be the mono-unary algebra with f(n) = n+1. Show A has the universal 
mapping property for the class of mono-unary algebras over the set {0}. 


Let p be a prime number, and let Z, be the set of integers modulo p. Let Z, be 
the mono-unary algebra (Z,, f) defined by f(7%) = n+ 1. Show Z, has the universal 
mapping property for K over {1}, where K is the class of mono-unary algebras (A, f) 
satisfying f?(x) & x. 


. Show that the group Z = (Z,+,—,0) of integers has the universal mapping property 


for the class of groups over {1}. 


. If V is a variety and |X| < |Y| show Fy(X) can be embedded in Fy(Y) in a natural 


way. 


. If U(X) € K and U(X) has the universal mapping property for kK over X show that 


U(X) & Fx(X) under a mapping a such that a(x) =F. 


. Show that for any algebra A and a,b € A, O((a, b)) = t*(s({ (p(a, 0), p(b, ©) : p(x, ys, 


.-;Yn) isaterm, c1,...,Cn € A}))UA4, where t*( _) is the transitive closure operator, 
ie., for Y C Ax A, t*(Y) is the smallest subset of A x A containing Y and closed 
under t. (See the proof of 5.5.) 
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One of the most celebrated theorems of Birkhoff says that the classes of algebras defined 
by identities are precisely those which are closed under H, S, and P. In this section we 
study identities, their relation to free algebras, and then give several applications, including 
Birkhoff’s theorem. We have already seen particular examples of identities, among which are 
the commutative law, the associative law, and the distributive laws. Now let us formalize 
the general notion of an identity, and the notion of an identity holding in an algebra A, or 
in a class of algebras K. 


Definition 11.1 An identity of type F over X is an expression of the form 


pS 
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where p,q € T(X). Let Id(X) be the set of identities of type ¥ over X. An algebra A of type 
F satisfies an identity 
iS] Cena 2) eas or Deere Fe) 
(or the identity is true in A, or holds in A), abbreviated by 
A I p(t, tee sa) x q(“1, oo ns 


or more briefly 


AFpeq 
if for every choice of aj,...,@, € A we have 
p* (a1, eee q* (a1, eee blie 


A class K of algebras satisfies p & q, written 


KD 4q, 
if each member of K satisfies p = q. If % is a set of identities, we say K satisfies 1, written 
Gi ose 
if K -: pq for each pq € &. Given K and X let 


Idk (X) ={peqeId(X):K Ep gq}. 


We use the symbol £ for “does not satisfy.” 


We can reformulate the above definition of satisfaction using the notion of homomor- 
phism. 


Lemma 11.2. Jf K is a class of algebras of type F and p & q 1s an identity of type F over 
X, then 
KEpeq 


iff for every A € K and for every homomorphism a: T(X) > A we have 


ap = aq. 


PRooF. (=) Let p = p(a1,...,2n), d = Q(41,---,;2n). Suppose K F pxq, A € K, and 
a: T(X) — A is a homomorphism. Then 


p*(az,...,02,) = q*(az1,..., Ln) 
=> ap™*®)(x1,...,£,) = aq™™) (x1,..., an) 


=> ap = aq. 
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(<) For the converse choose A € K and aj,...,d, € A. By the universal mapping 
property of T(X) there is a homomorphism a: T(X) — A such that 


AX; = ai, Ll<i<n. 
But then 
p*(a1,-..,@n) = p*(ar1,..., 2p) 
= q*(ax1, aaa.) 
= oF (ais oS lps) 
sok Eps q. 


Next we see that the basic class operators preserve identities. 


Lemma 11.3. For any class K of type F all of the classes Kk, I(x), SUX), H(kK), PU) 
and V(K) satisfy the same identities over any set of variables X. 


PROOF. Clearly kK and I(K) satisfy the same identities. As 
Peis. «Fh Mid: 72g 
we must have 
Idx (X) D Idscxy(X), Iduxy(X), and — Idpyxy(X). 


For the remainder of the proof suppose 


LO Dl Vi saay ny EO Vig seas ta 


Then if B< AE K and }j,...,6, € B, then as b;,...,5, € A we have 


hence 
pP (bi, » bn) =q>(b1,.--, bn), 
SO 
BEpRrq 
Thus 
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Next suppose a: A — B is a surjective homomorphism with A € K. If b;,...,b, € B, 
choose aj,...,@, € A such that 


a(a,) = bi, Sipe) Oda) Uns 
Then 
pr’ (@1,---,@n) =" (@1,---, Gn) 
implies 
ap’ (a1,.--,4n) = ag” (a1,---, An); 
hence 
pe (bi, »bn) =" (b1,---, bn) 
Thus 
BrFpeg, 
so 


Idx«(X) = Idayny(X). 
Lastly, suppose A; € K for 7 ¢ J. Then for a1,...,@n € A= |],<; Ai we have 


p*(ai(i), Sane ; An (t)) = q*(ai(i), ei: an(%)); 


hence 


for 2 € I, so 
dul (cs een oe ear Cs ee eB 
Thus 
Idx (X) = Idpxy(X). 
As V = HSP by 9.5, the proof is complete. 


Now we will formulate the crucial connection between K-free algebras and identities. 


Theorem 11.4. Given a class K of algebras of type F¥ and terms p,q € T(X) of type F we 
have 


@D=G in Fx(X) 
= (p,q) € Ox(X). 
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PRoor. Let F = Fx(X), p= p(r1,...,2n), d= 9(21,---,2n), and let 


y:T(X)-F 


be the natural homomorphism. Certainly K — p = q implies FE pxqasF € ISP(K). 
Suppose next that F — p = q. Then 


p’ (Zi, 25 pia) <2 q(T, oat 5 Leni 


hence p = q. Now suppose p = 7 in F. Then 


sO 
(p,q) € kerv = 0x(X). 


Finally suppose (p,q) € @x(X). Given A € K and ay,...,a, € A choosea: T(X) > A 
such that ax; = aj, 1 <i<n. As kera € ®x(X) we have 
ker a D kery = 6x(X), 


so it follows that there is a homomorphism 3: F — A such that a = (ov (see §6 Exercise 
6). Then 


a(p) = Bov(p) = Bov(q) =a(q). 
Consequently 
KEpeq 


by 11.2. 


Corollary 11.5. Let K be a class of algebras of type F, and suppose p,q € T(X). Then for 
any set of variables Y with |Y| > |X| we have 


KEpeq iff Fx(Y) Epeg. 


ProoF. The direction (=) is obvious as Fx (Y) € JSP(K). For the converse choose Xy D X 
such that |Xo| = |Y|. Then 
Fx (Xo) = Fx(Y), 
and as 
KEpeq iff Fe(Xo) Epeq 


by 11.4 it follows that 
KEpreq iff Fe(Y) Epe¢@ 
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Corollary 11.6. Suppose K is a class of algebras of type F and X is a set of variables. 
Then for any infinite set of variables Y, 


Proor. For p = q € Idx(X), say p = p(%,...,2%n), ¢ = G(£1,---,;2n), we have p,q € 
PAG s5 35 AS. Te el <I | by: TES 


KEpeq iff Fx(Y) Rpg, 


so the corollary is proved. 


As we have seen in $1, many of the most popular classes of algebras are defined by 
identities. 


Definition 11.7. Let © be a set of identities of type F, and define /(%) to be the class 
of algebras A satisfying ©. A class K of algebras is an equational class if there is a set of 
identities © such that K = M(%). In this case we say that K is defined, or axiomatized, by 
3 


Lemma 11.8. If V is a variety and X is an infinite set of variables, then V = M(Idy(X)). 


PROOF. Let 
Vi= M(Idy(X)). 


Clearly V’ is a variety by 11.3, V’ D V, and 
Idy:(X) = Idy(X). 


So by 11.4, 


Fy (X) = Fy(X). 


Now given any infinite set of variables Y, we have by 11.6 


Idy(Y) = Idp x (Y) = Idp, xy (Y) = Idy(Y). 
Thus again by 11.4, 
Oy(Y) = Ov(Y); 


hence 


Fy:(Y) = Fy(Y). 
Now for A € V’ we have (by 10.11), for suitable infinite Y, 


A € H(Fy(Y)); 
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hence 
A € H(F\(Y)), 


so A € V; hence V’ C V, and thus V’ = V. 


Now we have all the background needed to prove the famous theorem of Birkhoff. 
Theorem 11.9 (Birkhoff). kK is an equational class iff K is a variety. 


PROOF. (=) Suppose 


Kk = M(x) 
Then 
VA) ED 
by 11.3; hence 
V(K) C M(X), 
SO 
V(K)=K, 


i.e., K is a variety. 
(<=) This follows from 11.8. 


We can also use 11.4 to obtain a significant strengthening of 10.12. 


Corollary 11.10. Let K be a class of algebras of type F. If T(X) exists and K' is any class 
of algebras such that K C K' CV(K), then 


F(X) = F(X). 


In particular it follows that 


PROOF. Since Idg(X) = Idy; ae ) by 11.3, it follows that Idg(X) = Idx(X). Thus 
big (X) = O«(X), so Fx:(X) = Fx(X). The last statement of the corollary then follows 
from 10.12. 


So far we know that K-free algebras belong to 1S P(k). The next result partially sharpens 
this by showing that large K-free algebras are in I Ps(K). 


Theorem 11.11. Let K be a nonempty class of algebras of type ¥. Then for some cardinal 
m, if |X| >m we have 
Fx(X) € IPs(Kk). 
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ProoF. First choose a subset K* of K such that for any X, Idq«(X) = Id«(X). (One can 
find such a K* by choosing an infinite set of variables Y and then selecting, for each identity 
p= qin Id(Y) — Idx(Y), an algebra A € K such that A | p & q.) Let m be any infinite 
upper bound of {|A|: A € A*}. (Since A* is a set such a cardinal m must exist.) 

Next let Ux+(X), for any X, be {@ € Con T(X) : T(X)/¢ € I(K*)}. Then Ux«(X) C 
Ox«(X), hence (} Ux«(X) D Ox«(X). To prove equality of these two congruences for |X| > m 
suppose (p,q) ¢ Ox+(X). Then K* |- p = q by 11.4; hence for some A € K*, AKA pq. 
If p = p(x1,...,2n), ¢ = G(21,...,2n), Choose a1,...,@, € A such that p*(aj,...,an) # 
q*(a1,-..,;@n). As |X| > |A] we can find a mapping a : X — A which is onto and ax; = 
a;, 1 <i <n. Then a can be extended to a surjective homomorphism (3 : Fx-(X) — A, 
and G(p) # B(q). Thus (p,q) ¢ ker8 € Ux-(X), so (p,q) ¢ (\VUx+(X). Consequently 
() Ux+(X) = On+(X). AS F(X) = Fx+(X) by 11.4, it follows that Fx(X) = T(X)/() Vx«(X). 


Then (see §8 Exercise 11) we see that Fx(X) € IPs(K*) C IPs(K). 


Theorem 11.12. V = HPs. 


PROoF. As 
Pe SP 


we have 


HPs < HSP=V. 


Given a class K of algebras and sufficiently large X, we have 
Fycq)(X) € IPs(K) 


by 11.11; hence 
V(K) C APs(k) 


by 10.11. Thus 
Y= APs. 


REFERENCE 


1. G. Birkhoff [1935] 


EXERCISES §11 


1. Given a type F and a set of variables X and p,q € T(X) show that T(X) Ep & q iff 
p=q (thus T(X) does not satisfy any interesting identities). 


2. If V is a variety and X is infinite, show V = HSP(Fy(X)). 


§12. Mal’cev Conditions 85 


3. If X is finite and Idy(X) defines V does it follow that V = HSP(Fy(X))? 
4. Describe free semilattices. 


5. Show that if V = V(A) then, given X 4 2, Fy(X) can be embedded in All". In 
particular if A has no proper subalgebras the embedding is also subdirect. 


§12. Mal’cev Conditions 


One of the most fruitful directions of research was initiated by Mal’cev in the 1950’s when 
he showed the connection between permutability of congruences for all algebras in a variety 
V and the existence of a ternary term p such that V satisfies certain identities involving p. 
The characterization of properties in varieties by the existence of certain terms involved in 
certain identities we will refer to as Mal’cev conditions. This topic has been significantly 
advanced in recent years by Taylor. 


Lemma 12.1. Let V be a variety of type F, and let 


D Gigexss teas YAS ens 
Geiss Ce ige eos Uh) 


be terms such that in F = Fy(X), where 
Sa ieee Dit its 
we have 
UO" (iyi ss Dyn Ye Os (Eas arg Dae Uieeas Ua) oO Unk Un 


Then 


V P= DBs ot Bins Uy at) PS Oh isna ten Uy ass): 


PRooF. The homomorphism 
Qa. Fy(71,. : -5Lm, Yi) - . U5) a Fy (71,. : fDi) 


defined by 


and 
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is such that 
OG i+: +4 F,) CG kere: 
sO 
3) 6) Ce TIER Pe TEM een 17 Clea meme eT RR a 
thus 
TOT sees Meet ste, Uy a epee Oe) 
in Fy(@1,...,;%m,¥), so by 11.4 


V Bi sod eral Ads ah) POG Misc Dias Uy ds 4) 


Theorem 12.2 (Mal’cev). Let V be a variety of type F¥. The variety V is congruence- 
permutable iff there is a term p(x, y,z) such that 


VE p(a,2,y) &y 
and 


VE p(2,y,y) & &. 


Proor. (=) If V is congruence-permutable, then in Fy (%, 7, Z) we have 
7) € 0,7) 00.2) 
se) 
(EZ) © OG, 2) 0 Osa): 
Hence there is a p(%, 9, Z) € Fv (%, 7%, Z) such that 
7O(9,7)p(F.9,2)0 (E07 


By 12.1 
VEp(a,y,y) ® x 


and 
VF pa, 2, 2) © z. 


(<=) Let A € V and suppose ¢, 7 € Con A. If 
(a,b) € dow, 


say adcwb, then 
b = p(c,c, b) pla, ¢, b) pla, b, b) = a, 
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so 
(b,a) € poy. 
Thus by 5.9 
pop= vod. 


EXAMPLES. (1) Groups (A,-,~1, 1) are congruence-permutable, for let p(x, y, z) be x-y~!+z. 


(2) Rings (R,+,-,—,0) are congruence-permutable, for let p(x, y, z) be x —y + z. 
(3) Quasigroups (Q,/,-,\) are congruence-permutable, for let p(x,y,z) be (x/(y\y)) - 
(y\2). 


Theorem 12.3. Suppose V is a variety for which there is a ternary term M(x, y,z) such 
that 


Then V is congruence-distributive. 
PROOF. Let ¢,w~,x € Con A, where A € V. If 


(a,b) €@A (WV Xx) 


then (a,b) € @ and there exist ci,...,¢, such that 


APC XC2 +++ WCnxd. 


But then as 
M(a, on b)oM(a, Ci; a) =a, 


for each 2, we have 
a= M(a, a, b)\(o \ v)M(a, C1, b)(o \ x)M(a, C2, b) a M(a, Cn) b)(o \ x)M (a, b, b) = b, 


0) 
(a,b) € (PAW) V (PAX). 
This suffices to show 


ON(WVX) = (PAY) V(GAX), 


so V is congruence-distributive. 


EXAMPLE. Lattices are congruence-distributive, for let 


M(a,y,z2) =(a@Vy)A(av2z)A(yVz). 
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Definition 12.4. A variety V is arithmetical if it is both congruence-distributive and 


congruence-permutable. 


Theorem 12.5 (Pixley). 


conditions 


A variety V is arithmetical iff it satisfies either of the equivalent 


(a) There are terms p and M as in 12.2 and 12.3. 
(b) There is a term m(a, y, z) such that 


V 


E m(x,y, 2) & m(z,y,y) & my, y, 2) & a. 


ProoF. If V is arithmetical then there is a term p as V is congruence-permutable. Let 
F(Z, 7, Z) be the free algebra in V freely generated by {%, 7, Z}. Then as 


it follows that 


(, 2) € O(%, 2) N[O@, 9) V OG, 2) 


(T, 2) € [O@,z)N O(7,9)] V [O@, z) NOG, Z)]; 


hence 


(Z,2) € [O(Z,z)N O(%,y)] 0 (OG, z) N OG, Z)]. 
Choose M(Z,¥,Z) € Fv(%,Y,Z) such that 


Z|O(Z, Z) N O(%, 9) |M (Z, y, Z)[O(Z, Z) N O(Y, Z) Iz. 


Then by 12.1, 


VE M(a,2,y) = M(x,y,2) © M(y, 2,2) © «. 
If (a) holds then let m(z,y,z) be p(x, M(2z,y,z),z). Finally if (b) holds let p(x,y,z) be 


m(x,y,z) and let M(x, y, z) be m(x,m(z, y, z), z), and use 12.2 and 12.3. 


EXAMPLES. (1) Boolean algebras are arithmetical, for let 


m(S,9,2) = (NZ V (Ay AZIVG Ay he). 


(2) Heyting algebras are arithmetical, for let 


m(x,y,2) =[(e > y) > A [(z 9) a] A lev 2) 


Note that 12.3 is not a Mal’cev condition as it is an implication rather than a character- 
ization. Jonsson discovered a Mal’cev condition for congruence-distributive varieties which 
we will make considerable use of in the last chapter. 
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Theorem 12.6 (Jénsson). A variety V is congruence-distributive iff there is a finite n and 
terms po(X,Y,Z),---;Pn(@,y,z) such that V satisfies 


PAL, Ysa) Ls 0<i<n 
Pol, y, 2) & x, Dale. 2) Se 

pile, t, y) x pals; z, y) fori even 
pilx,Yy,Y) © pisi(2, y, y) for i. odd. 


PROOF. (=) Since 
O(Z, z) A (OF, 9) V OW, Z)] = [O@, Zz) A 0%, y)] V [O(@, Zz) A OG, 2) 


in F(Z, 7, Z) we must have 


and from these the desired equations fall out. 
(<=) For ¢,w,v € Con A, where A € V, we need to show 


ONWVX) E(PAY)V(GAX), 
so let 
(a,b) EdA (WV x). 
Then (a,b) € @, and for some cj,...,c; we have 
aweyx ...cexb. 


From this follows, for 0 <i <n, 


pila, a, b)wp;(a, c1,b)x ... pila, ce, b)xpi(a, 6, b); 


hence 
pila, a, b) (o /\ w)pila, C1, b) (¢ /\ x) ps pila, Ct, b) (o /\ x) pila, b, b), 
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pila, a, b) (e /\ v) Vv (o /\ x)|pi(a, b, b), 


0 <i<n. Then in view of the given equations, a|(¢ A w) V (¢ A x)]b, so V is congruence- 
distributive. 


By looking at the proofs of 12.2 and 12.6 one easily has the following result. 


Theorem 12.7. A variety V is congruence-permutable (respectively, congruence-distributive) 
iff Fy (Z, 9, Z) has permutable (respectively, distributive) congruences. 


For convenience in future discussions we introduce the following definitions. 


Definition 12.8. A ternary term p satisfying the conditions in 12.2 for a variety V is called 
a Mal’cev term for V, a ternary term M as described in 12.3 is a majority term for V, and a 
ternary term m as described in 12.5 is called a 2-minority term for V. 


The reader will find Mal’cev conditions for congruence-modular varieties in Day [1] below. 
REFERENCES 


. A. Day [1969] 

. B. Jénsson [1967] 

. A.I. Mal’cev [1954] 
. A.F. Pixley [1963] 
. W. Taylor [1973] 
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EXERCISES §12 
1. Verify the claim that Boolean algebras [Heyting algebras] are arithmetical. 


2. Let V be a variety of rings generated by finitely many finite fields. Show that V is 
arithmetical. 


3. Show that the variety of n-valued Post algebras is arithmetical. 


4. Show that the variety generated by the six-element ortholattice in Figure 19 is arith- 
metical. 
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0 
Figure 19 


813. The Center of an Algebra 


Smith [6] introduced a generalization to any algebra in a congruence-permutable variety of 
the commutator for groups. Hagemann and Herrmann [3] then showed that such commuta- 
tors exist for any algebra in a congruence-modular variety. Using the commutator one can 
define the center of such algebras. Another very simple definition of the center, valid for any 
algebra, was given by Freese and McKenzie [1], and we will use it here. 


Definition 13.1. Let A be an algebra of type F. The center of A is the binary relation 
Z(A) defined by: 
(a,b) € Z(A) 


iff for every p(x, Y1,---,;Yn) € T (2, Y1,---; Yn) and for every c1,..., Cn, di,-..,dn € A, 


lds Cip.et Oy)! SPE igaisdyy a plby Gis 235 hy) SOO ai. 25d 


Theorem 13.2. For every algebra A, the center Z(A) is a congruence on A. 


PROOF. Certainly Z(A) is reflexive, symmetric, and transitive, hence 7(A) is an equivalence 
relation on A. Next let f be an n-ary function symbol, and suppose (a;,b;) € Z(A), 1<i< 
n. Given a term p(Z, y1,---,;Ym) and elements c1,...,Cm,di,..-,dm of A, from the definition 
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of Z(A) we have 


P(f (a1, aa, aa 1 On), Cc) =a P(f (a1, a2, ee ; Qn), 
iff DT Wis Goeat Qa) e) = p(f(b1, a2,..-,@n), 


iff P(f (1, aa) Danas An), Cc) = P(f(b1, see Dats An), d) 
iff p(f(b1,..-,bn),@) = p(f (br, .--; bn), @); 


hence 


sO 
FP (Ois a sO ls f Oise; Ua) EZCAD! 
Thus Z(A) is indeed a congruence. 


Let us actually calculate the above defined center of a group and of a ring. 


EXAMPLE. Let G = (G,-,~1,1) be a group. If (a,b) € Z(G) then, with the term 
P(X, V1, Y2) = Y1° + Y2 and c € G, we have 


pasa *,6)=p(a,ea-"); 


hence 
s c) = pb, C, a), 


plo, a 
that is, 
G2 bsGSerbea. 
With c = 1 it follows that 
ao = baa: 
hence for c € G, 
a+-b-c=c-a'-b, 
consequently (a,b) is in the congruence associated with the normal subgroup N of G which 
is the usual group-theoretic center of G, i.e, N={gEG:h-g=g-h for he G}. 
Conversely, suppose N is the usual group-theoretic center of G. Then for any term 
P(L,Y1,--+;Yn) and elements a,b, c1,...,Cn,d1,...,dn € G, if a+b! € N, and if 


then 
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SO : 
p(b,€) = p(d, d) 
as a-b-! is central. So, by symmetry, if a-b-' € N then 


> 


p(a,é) =pla,d) iff p(b,2) = p(b,d), 

so (a,b) € Z(G). 
Thus 
Z(G) = {(a,b) € G?: (a-b")-c=c- (a-b") forc € G}. 
EXAMPLE. Let R = (R,+,-,—,0) be a ring. If (r,s) € Z(R) then, for t € R, 
PEt) ea) 20, 
hence replacing the underlined r by s we have 
(r—s)-t=0. 


Likewise 
t-(r—s) =0, 


so r—s € Ann(R), the annihilator of R. Conversely, if r— s € Ann(R) and p(z, y1,..-, Yn) 
is a term and c,...,Cn,d1,...,d, € R then from 


it follows that 
and thus 


By symmetry, we have 
Z(R) = {(r,s):r—s € Ann(R)}. 


Now we return to the fundamental theorem of centrality, namely the characterization of 
modules up to polynomial equivalence. 


Definition 13.3. Let A be an algebra of type ¥. To Fy add symbols a for each a € A, 
and call the new type F,, and let Ay be the algebra of type F4 which is just A with a 
nullary operation corresponding to each element of A. The terms of type F,4 are called the 
polynomials of A. We write p* for p“4. Two algebras Ay = (A, Fi) and Ay = (A, Ff), 
possibly of different types, on the same universe are said to be polynomially equivalent if 
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they have the same set of polynomial functions, i.e., for each polynomial p(x1,...,2%,) of Ay 
there is a polynomial q(x1,...,2,) of Ag such that p“! = q“?, and conversely. 


The following proof incorporates elegant arguments due to McKenzie and Taylor. 


Theorem 13.4 (Gumm, Hagemann, Herrmann). Let A be an algebra such that V(A) is 
congruence-permutable. Then the following are equivalent: 

(a) A is polynomially equivalent to a left R-module, for some R. 

(b) Z(A) = Va. 

(c) {(a,a) : a € A} ts a coset of a congruence on A x A. 


PROOF. (a) => (b): If A is polynomially equivalent to a module M = (M, +, —,0, (f;)rer), 
then for every term p(%, y1,---,Y%n) of A there is a polynomial 


q(x, V1, eats Yn) = f,(2) 7 flGi) Pagel Frn (Yn) +m 
of M such that 
po =4d 

Thus for a,b, c1,...,Cn,d1,...,dn € A, if 

DG Ci his Cy) SOG ics med Uy) 
then 

GOs Cindi gly "OW dign2 gd) 
hence if we subtract f,(a) from both sides, 

fr (cr) H+ + fon (Cn) +m = fr, (di) + +++ + fon (dn) +m, 

so if we add f,(b) to both sides, 

qg(b, Cj,--- Cp) = qg(b, di, tee Bs ie 
consequently 


eC eer pemerre om) ee 11 (sue, permenne f9 B 
By symmetry, 


p(a,@) = p(a,d) iff p(b,2) = p(b,d); 
hence Z(A) = Vy. 
(b) = (c): First note that X = {(a,a) : a € A} is a coset of some congruence on A x A 
iff it is a coset of O(X), the smallest congruence on A x A obtained by identifying X. Now, 


from §10 Exercise 7, 


O({(a, a) QE A}) _ t*(s({(p>*A (a, a), (c1, di), oar) (Cn, dn)), 


BP Cb, DY 4 Gig Ui ys ecg Cay GaN) SO UpCip newer pdt cus, e A 
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and p is a term })) U Aaya. 7 
Hence X is a coset of O(X) iff for every G,b,c1,...,Cn,di,...,dy, © A and every term 


P(@, Yi, ee Ge) 
p®*A((G,a), om dy), sueae ty \Cis dy)) € xX 


iff 
that is, 


Thus X is a coset of O(X) iff Z(A) = Va. 
(b) = (a): Given that Z(A) = Va, let p(x, y,z) be a Mal’cev term for V(A). Choose 
any element 0 of A and define, for a,b € A, 


Then 


a7 

Next observe that for a, b,c,d,e € A, 
p(p(a, a, a), d, p(b, €, €)) = p(p(a, d, b), e, p(G & €)); 
hence, as (e,c) € Z(A), we can replace the underlined e by c to obtain 
p(p(a, a, a), d, p(b, €, €)) = p(p(a, d, b), e, p(¢, ¢, ¢)), 


SO 
p(a, d, p(b, e, c)) = p(p(a, d, b), e, c). 


Setting d = e = 0, we have the associative law 
at+(b+c)=(a+b) +e. 
Next, 


a+ (—a) 


p(a, 0, p(0, a, 0)) 
p(p(a, 0, 0), a, 0) 
p(a, a, 0) 
0 
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By 
p(a, b, b) = p(b, b, a) 
and the fact that (0,b) € Z(A), we can replace the underlined b by 0 to obtain 


p(a, 0, b) = p(b, 0, @); 


hence 
a+b=b+a, 
so (A,+,—,0) is an Abelian group. 
Next we show that each n-ary term function p“(x1,...,2,) of A is affine for (A, +, —, 0), 
i.e., it is a homomorphism from (A,+,—,0)” to (A,+,—,0) plus a constant. Let a1,...,@n, 


by,...,0, € A. Then 
p(ai + 0,...,@n +0) + p(0,...,0) = p(0+0,...,0+0)+ p(ai,..., an). 
As (0,61) € Z(A) we can replace the underlined 0’s by 6; to obtain 
p(ay + by, dg +0,...,a, +0) + p(0,...,0) = p(0+b;,0+0,...,0 +0) + p(ay,..., an). 
Continuing in this fashion, we obtain 


p(a, + b4,.--,4n + b,) + p(0,...,0) = p(br,..., bn) + p(ai,.--, An) 
=pl@ijas2 (Aq) POs. Ua): 


Thus p“(21,...,2n) —p*(0,...,0) is a group homomorphism from (A, +, —, 0)” to (A, +, —, 0). 
To construct the desired module, let R be the set of unary functions p“(z,c1,...,€,) on 
A obtained by choosing terms p(x, y1,---,Y%n) and elements ci,...,¢, € A such that 


(UW emer cow nO 
For such unary functions we have 


p(at+b,¢1,---,€n) = p(@,C1,---,Cn) + p(b,0,...,0) — p(0,...,0) 


and 
p(b, c1,---,Cn) = p(b, 0,...,0) + p(0, c1,..-, en) — p(0,...,0) 
p(b,0,...,0) — p(0,..-,0); 
hence 
Pla AP OYE any) Dil Cy ea) OCH oiny Ca (x) 


Thus each member of R is an endomorphism of (A, +, —, 0). 


813. The Center of an Algebra 97 


Clearly R is closed under composition 0, and for r,s € R define r + s and —r by 


(r + s)(a) = r(a) + s(a) = p(r(a), 0, s(@)) 
(—r)(a@) = —r(a) = p(0,r(a), 0). 
Then r+s,—r € R. Let 0 be the constant function on A with value 0, and let 1 be the 
identity function on A. Then 0,1 € R as well. We claim that R = (R,+,-,—,0,1) isa 
ring. Certainly (R,+,—,0) is an Abelian group as the operations are defined pointwise in 
the Abelian group (A,+,-—,0), and (R,-,1) is a monoid. Thus we only need to look at the 
distributive laws. If we are given r,s,t © R, then 


(7 + s) ot](a) = (r + 5)(#(a)) 

r(t(a)) + s(t(a@)) 

= (rot)(a) + (sot)(a) 
=(rot+sot)(a); 


hence 
(r+s)ot=rot+sot. 
Also 
[ro (s + t)](a) =r((s + t)(a)) 
= r(s(a) + t(a)) 
= r(s(a)) + r(t(a)) (by (*) above) 
= (r ° s)(a) + (r ° t)(a) 
(ros+rot)(a); 
hence 


o(s+t)=(ros)+(rot). 
This shows R is a ring. 
Now to show that M = (A,+,—,0, (r),cr) is a left R-module, we only need to check the 
laws concerning scalar multiplication. So let r,s € R, a,b € A. Then 
(r+ s)(a) =r(a) + s(a) (by definition) 
r(a+b) =r(a) +r(b) by («)) 
(ro s)(a) = r(s(a)). 
Thus M is a left R-module (indeed a unitary left R-module). 


The fundamental operations of M are certainly expressible by polynomial functions of A. 
Conversely any n-ary fundamental operation f“(21,..., 2») of A satisfies, for a1,...,an € A, 


Fl Giese Ga) HF O30) SF aa Oe FO 0) 
+---+(f(0,...,an) — f(0,...,0)). 
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As 


fr, = f"(0, -..50,;2) — fA0,..,0) Ee R 
it follows that 
fA (ai, ...,¢n) =ri(ei) +-+-+ral(en) + £(0,...,0); 


hence each fundamental operation of A is a polynomial of M. This suffices to show that A 
and M are polynomially equivalent. 


Actually one only needs to assume V(A) is a congruence-modular in Theorem 13.4 (see 
(4) or (7) below). 
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EXERCISES 813 


1. If A belongs to an arithmetical variety, show that Z(A) = Ay. [Hint: if (a,b) € Z(A) 
use m(a, b,a) = m(b, b, a).] 


2. Show that (a,b) € Z([],<, Ai) iff (a(z), b(@)) € Z(A;) for a € I. 
3. If A < B and Z(B) = Vz, show Z(A) = Va. 


4. If B € H(A) and A is in a congruence-permutable variety, show that Z(A) = Va 
implies Z7(B) = Vg. Conclude that in a congruence-permutable variety all members 
A with Z7(A) = V4 constitute a subvariety. 


5. Suppose A is polynomially equivalent to a module. If p(z,y,z),¢(x,y,z) are two 
Mal’cev terms for A, show p“(z, y, z) = q4(z, y, z). 
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6. (Freese and McKenzie). Let V be a congruence permutable variety such that Z(A) = 
Va for every A € V. Let p(x, y, z) be a Mal’cev term for V. Define R by 


RAG yy) Se lye G) 27, B= a 


(Note that if r(@, 7) = s(Z,7), then r(Z,z) = F iff s(%,T) = ZT.) Define the operations 
+,-,—,0,1 on R by 


r(z,¥) + 8(@,%) = P(r, 9), 9, (9) 
re, y) : s(F, y) a r(s(Z, y)9 
TUE, y) = PY, T(z, y),¥) 
v=79 
1=f. 


Verify that R = (R,+,-,—,0,1) is a ring with unity. Next, given an algebra A € V 
and n € A, define the operations +, —,0, (f;-)rer on A by 


Now verify that (A,+,—,0,(f;)rer) is a unitary R-module, and it is polynomially 
equivalent to A. 


§14. Equational Logic and Fully Invariant Congruences 


In this section we explore the connections between the identities satisfied by classes of alge- 
bras and fully invariant congruences on the term algebra. Using this, we can give a complete 
set of rules for making deductions of identities from identities. Finally, we show that the 
possible finite sizes of minimal defining sets of identities of a variety form a convex set. 


Definition 14.1. A congruence @ on an algebra A is fully invariant if for every endomor- 
phism a on A, 
(a,b) € 6 = (aa, ab) € 6. 


Let Congy(A) denote the set of fully invariant congruences on A. 


Lemma 14.2. Conpy(A) is closed under arbitrary intersection. 
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PROOF. (Exercise.) 


Definition 14.3. Given an algebra A and S C A x A let Op,(9) denote the least fully 
invariant congruence on A containing S. The congruence Of;(S) is called the fully invariant 
congruence generated by S. 


Lemma 14.4. If we are given an algebra A of type F then Op, is an algebraic closure 
operator on A x A. Indeed, Op, is 2-ary. 


PRooF. First construct A x A, and then to the fundamental operations of A x A add the 
following: 


(a, a) forae A 


a, d) iL b= 
a, b) otherwise 


€,((a, b)) = (aa, ob) for 0 an endomorphism of A. 


Then it is not difficult to verify that @ is a fully invariant congruence on A iff 6 is a subuniverse 
of the new algebra we have just constructed. Thus Of, is an algebraic closure operator. 

To see that Op; is 2-ary let us define a new algebra A* by replacing each n-ary funda- 
mental operation f of A by the set of all unary operations of the form 


f(a, o 2+) Qj-1,%, Qj41,.-.-.-, iy) 
where @j,...,@j—1, @i41,---,@n are elements of A. 
Claim. Con A = Con A*. 


Clearly 6 € Con A = @ © Con A®*. For the converse suppose that 6 € Con A* and 
f © F,. Then for 


(a;,b;) € 8, Ll<i<n, 
we have 
(f(a, +++5Qn-1; a) f(a, +++) Qn-1, bn)) € 60 
CRU Gas2:5.4:9 On SASOn ef (Ot; “Dias Dye) Cc 6 
(f (ai, bo, cas , ba), f(b, pak bn)) E 0; 
hence 


(f(a1,---,@n), f(b1,.--,0n)) € 8. 
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Thus 
6€Con A. 


If now we go back to the beginning of the proof and use A* instead of A, but keep the e,’s 
the same, it follows that Of; is the closure operator Sg of an algebra all of whose operations 
are of arity at most 2. Then by 4.2, Op; is a 2-ary closure operator. 


Definition 14.5. Given a set of variables X and a type J, let 
T:Id(X) — T(X) x T(X) 


be the bijection defined by 
T(p © q) = (p,q). 


Lemma 14.6. For K a class of algebras of type F and X a set of variables, T(Idx(X)) ts 
a fully invariant congruence on T(X). 


PRooF. As 


pp €lIdx(x) forpEe T(x) 
pqé€ Idk(X) > q= pe Idx(X) 
pegqq~réldk(X) > pre Idx(X) 


it follows that T(Idx(X)) is an equivalence relation on T(X). Now if 
pia €ldx(X) forl<i<n 
and if f € F, then it is easily seen that 
f(Pi,-- +5 Pn) © F(M,---5 dn) € Id (X), 
so T(Id«(X)) is a congruence relation on T(X). Next, if a is an endomorphism of T(X) and 
p(1,--+;En) ¥ q(@1,.--, Ln) € Idx (X) 
then it is again direct to verify that 


p(an,...,A%n) ¥ q(axi,...,@I%n) € Idx (X); 


hence T(Idx(X)) is fully invariant. 


Lemma 14.7. Given a set of variables X and a fully invariant congruence 0 on T(X) we 
have, for p= q € Id(X), 
T(X)/@ Rp eq > (p,q) € 8. 
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Thus T(X)/0 is free in V(T(X)/8). 


PROOF. (=>) If 


then 


(<=) Given rj,... 


hence 


Thus 


TX Opies itn.) YG iy oe On) 
Slay PO a4 se ta lO fly aes Tale) 
SDs en) = Olay oe Te 
Spl tise odin Gl iy erg) ee 
=> (p,q) € 0. 


,?m © T(X) we can find an endomorphism ¢ of T(X) with 


Ea Sa Lees 


PGi sco ee Ss )y Gg sy ea) SO 

Sep higs ote 60 Miss oe ee) SG 
SLO Tis ness Tals Ciseeeota)) €o 

SS D/O, <3 T rl 0) = Gi One taf @): 


T(X)/OEpeg. 


For the last claim, given p ® q € Id(X), 


so T(X)/@ is free in V(T(X)/0) by 11.4. 


(p,q) C0 T(X)/O = peg 
= V(T(X)/0) EE pq (by 11.3), 


Theorem 14.8. Given a subset © of Id(X), one can find a K such that 


D = Idx (X) 


iff T(%) is a fully invariant congruence on T(X). 


PROOF. (=) This was proved in 14.6. 
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(<=) Suppose 7(%) is a fully invariant congruence 6. Let K = {T(X)/6}. Then by 14.7 


KE peas (p,q) € 
SpxrqeX. 


Thus © = Idx (X). 


Definition 14.9. A subset © of Id(X) is called an equational theory over X if there is a 
class of algebras K such that 
De = Idye(X): 


Corollary 14.10. The equational theories (of type ¥) over X form an algebraic lattice which 
is isomorphic to the lattice of fully invariant congruences on T(X). 


PrRooF. This follows from 14.4 and 14.8. 


Definition 14.11. Let X be a set of variables and © a set of identities of type F with 
variables from X. For p,q € T(X) we say 


LE pyq 
(read: “XS yields p = q”) if, given any algebra A, 


AEX implies AEpsq. 


Theorem 14.12. If is a set of identities over X and pq is an identity over X, then 


“Epsq > (p,q) € Or(Td). 


PROOF. Suppose 
A’S X: 


Then as T(Ida(X)) is a fully invariant congruence on T(X) by 14.6, we have 
Or (7X) e TIda(X); 


hence 
(p, q) E Op (7X) >A KH pred, 


SO 
(p,q) € On(TX) > UE pg. 


Conversely, by 14.7 
T(X)/Or(7d) —& &, 
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so if 


then 


hence by 14.7, 


In the proof of 14.4 we gave an explicit description of the operations needed to construct 
the fully invariant closure Of;(S) of a set of ordered pairs S from an algebra. This will lead 
to an elegant set of axioms and rules of inference for working with identities. 


Definition 14.13. Given a term p, the subterms of p are defined by: 
(1) The term p is a subterm of p. 
(2) If f(pi,.-.,pPn) is a subterm of p and f € F, then each p; is a subterm of p. 


Definition 14.14. A set of identities over X is closed under replacement if given any 
p&qé€ and any term r € T(X), if p occurs as a subterm of r, then letting s be the result 
of replacing that occurrence of p by g, we have rs € &. 


Definition 14.15. A set of identities © over X is closed under substitution if for each p & q 
in © and for r; € T(X), if we simultaneously replace every occurrence of each variable x; in 
pq by r;, then the resulting identity is in X. 


Definition 14.16. If © is a set of identities over X, then the deductive closure D(X) of U 
is the smallest subset of Id(X) containing © such that 

pp e€ D(%) for pe T(X) 

p=qe DS) > qx pe D(d) 

pyqgqrre D(X) => pr re D(X) 

D(%) is closed under replacement 

D(%) is closed under substitution. 


Theorem 14.17. Given © C Id(X), pq € Id(X), 


UE pyqs pxqe D(X). 


ProoF. The first three closure properties make 7 D(X) into an equivalence relation contain- 
ing TY, the fourth makes it a congruence, and the last closure property says TD(%) is a fully 
invariant congruence. Thus 
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However 7~'Op1(T™) has all five closure properties and contains U; hence 
TD(=) = Orr(7). 
Thus 


“Ep qs (p,q) € Om(7Td) — (by 14.12) 
oprqe D(x). 


Thus we see that using only the most obvious rules for working with identities we can 
derive all possible consequences. From this we can set up the following equational logic. 


Definition 14.18. Let © be a set of identities over X. For p = q € Id(X) we say 
Depry g 
(read “XS proves p  q”) if there is a sequence of identities 
Pi ~ %1,--+5Pn + Qn 


from Id(X) such that each p; * q; belongs to %, or is of the form p ®& p, or is a result of 
applying any of the last four closure rules of 14.16 to previous identities in the sequence, 
and the last identity py, © dn is p © g. The sequence p, ¥ q,.--,DPn © dn is called a formal 
deduction of p & q, and n is the length of the deduction. 


Theorem 14.19 (Birkhoff: The Completeness Theorem for Equational Logic). Given % C 
Id(X) and pq € Id(X) we have 


Depry g eee pag 


PROOF. Certainly 
LeEpyq > pxrqe D(d) 


as we have used only properties under which D(%) is closed in the construction of a formal 
deduction py © q1,.--;Pn © Qn Of p& q. 
For the converse of this, first it is obvious that 


“NEpeq for prqek 


and 
“LEpsep for pEeT(Xx). 
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If 
DeEpryg 


then there is a formal deduction 


Pi ~~ Q,-++5Pn © An 
of p = q. But then 
Pi M,-+++,Pn © Gn; dn ~ Pn 


is a formal deduction of q & p. 
If 
“ur pegq, ur grr 


let 
Pi ~ Q,-++5Pn © An 


be a formal deduction of p & q and let 
Pi O1,--+> Pr I 


be a formal deduction of ¢ & r. 
Then 


Pi © Q1,--->Pn © Ins Pi © G++ +> Pr I: Pn © Ur 


is a formal deduction of p & r. 
If 
DeEpryeg 


let 
Pi Q,--+5Pn © dn 


be a formal deduction of p & q. Let 


bes saps as.) 


denote a term with a specific occurrence of the subterm p. Then 


Di Sites Da OAs (gC PRC a Lek g Comes Sremaes 


is a formal deduction of 


Finally, if 
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and f € F, then by writing the formal deductions of each p; & q; in succession and adding 
replacement steps to obtain the identity f(p1,...,Dn) © f(u,---;Gn) at the end we have a 
formal deduction of the latter, viz., 


LO Gig eee Se ig ct De ay) Dinesia Oa Bt aoa Dain eens 
Thus 
DS) Cipags Shp a 
hence 
D(X) = {pe q: Ub peg}, 
so by 14.17 


MEpygqs dkpeg¢. 


The completeness theorem gives us a two-edged sword for tackling the study of conse- 
quences of identities. When using the notion of satisfaction, we look at all the algebras 
satisfying a given set of identities, whereas when working with + we can use induction argu- 
ments on the length of a formal deduction. 


EXAMPLES. (1) An identity p = q is balanced if each variable occurs the same number of 
times in p as in q. If © is a balanced set of identities then using induction on the length of a 
formal deduction we can show that if © + p » q then p & q is balanced. [This is not at all 
evident if one works with the notion F .] 

(2) A famous theorem of Jacobson in ring theory says that if we are given n > 2, if U 
is the set of ring axioms plus x” = x, then & | x-y & y- a. However there is no known 
routine way of writing out a formal deduction, given n, of x-y ~ y- x. (For special n, such 
as n = 2,3, this is a popular exercise.) 


Another application of fully invariant congruences in the study of identities is to show 
the existence of minimal subvarieties. 


Definition 14.20. A variety V is trivial if all algebras in V are trivial. A subclass W of a 
variety V which is also a variety is called a subvariety of V. V is a minimal (or equationally 
complete) variety if V is not trivial but the only subvariety of V not equal to V is the trivial 
variety. 


Theorem 14.21. Let V be a nontrivial variety. Then V contains a minimal subvariety. 


Proor. Let V = M(X), © C Id(X) with X infinite (see 11.8). Then Idy(X) defines V, 
and as V is nontrivial it follows from 14.6 that 7(Idy(X)) is a fully invariant congruence on 
T(X) which is not V. As 

V = Ori((x, )) 
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for any z,y € X with x F y, it follows that V is finitely generated (as a fully invariant 
congruence). This allows us to use Zorn’s lemma to extend 7(Idy(X)) to a maximal fully 
invariant congruence on T(X), say 6. Then in view of 14.8, 7710 must define a minimal 
variety which is a subvariety of V. 


EXAMPLE. The variety of lattices has a unique minimal subvariety, the variety generated 
by a two-element chain. To see this let V be a minimal subvariety of the variety of lattices. 
Let L be a nontrivial lattice in V. As L contains a two-element sublattice, we can assume L 
is a two-element lattice. Now V(L) is not trivial, and V(L) C V, hence V(L) = V. [We shall 
see in IV§8 Exercise 2 that V is a variety of all distributive lattices. | 


We close this section with a look at an application of Tarski’s irredundant basis theorem 
to sizes of minimal defining sets of identities. 


Definition 14.22. Given a variety V and a set of variables X let 

IrB(Idy(X)) = {|| : © is a minimal finite set of identities over X defining V}. 
Theorem 14.23 (Tarski). Given a variety V and a set of variables X, IrB(Idy(X)) is a 
conver set. 


ProoFr. For © C Idy(X), © & Idy(X) implies 


Ori (TX) = TIdy(X). 


As Op is 2-ary by 14.4, from 4.4 we have the result. 


REFERENCES 
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EXERCISES §14 


1. Show that the fully invariant congruences on an algebra A form a complete sublattice 
of Con A. 


2. Show that every variety of mono-unary algebras is defined by a single identity. 
3. Verify the claim that consequences of balanced identities are again balanced. 


4. Given a type F and a maximal fully invariant congruence @ on T(z,y) show that 
V(T(a, y)/0) is a minimal variety, and every minimal variety is of this form. 
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10. 


i 


12, 


. If V isa minimal variety of groups show that Fy (%) is nontrivial, hence V = V(Fy(Z)). 


Determine all minimal varieties of groups. 


. Determine all minimal varieties of semigroups. 


. If p(x) is a term and » is a set of identities such that © — p(x) x and UE p(x) & 


p(y), show that © E x & y; hence M(%) is a trivial variety. 


. Let f,g be two unary operation symbols. Let N be the set of natural numbers, and 


for I C N let 
Ur = {fof"g'(«) sain Ee DU {fof"g'(x)  fof"g'(y) in ¢ Th. 


Show that M(X;) is not a trivial variety, but for J A J, M(X7) A M(%)) is trivial. 
Conclude that there are 2” minimal varieties of bi-unary algebras; hence some variety 
of bi-unary algebras is not defined by a finite set of identities. 


. If a variety V is defined by an infinite minimal set of identities show that V has at 


least continuum many varieties above it. 


(The compactness theorem for equational logic) If a variety V is defined by a finite set 
of identities, then for any other set © of identities defining V show that there is a finite 
subset No of & which defines V. 


Given © C Id(X) let an elementary deduction from & be one of the form 
i ea oy | Per BC kel Cee of | Par 


which is an identity obtained from p ¥ q, where p © q or g & p € &, by first substituting 
for some variable x the term ep, where € is an endomorphism of T(X), and then 
replacing some occurrence of ep in a term by eq. Show that D(X) is the set of r = s 
such that r = s or there exist elementary deductions r; ~ s;, 1 <7 < n, with 
r=71, §=Tisi, 1<t<n, and s, =s, provided X is infinite. 


Write out a formal deduction of x-y + y- x from the ring axioms plus 7- x & a. 


Chapter III 


Selected Topics 


Now that we have covered the most basic aspects of universal algebra, let us take a brief look 
at how universal algebra relates to two other popular areas of mathematics. First we discuss 
two topics from combinatorics which can conveniently be regarded as algebraic systems, 
namely Steiner triple systems and mutually orthogonal Latin squares. In particular we will 
show how to refute Euler’s conjecture. Then we treat finite state acceptors as partial unary 
algebras and look at the languages they accept—this will include the famous Kleene theorem 
on regular languages. 


81. Steiner Triple Systems, Squags, and Sloops 


Definition 1.1. A Steiner triple system on a set A is a family § of three-element subsets of 
A such that each pair of distinct elements from A is contained in exactly one member of 8. 
|A| is called the order of the Steiner triple system. 

If |A] = 1 then § = @, and if |A| = 3 then S$ = {A}. Of course there are no Steiner triple 
systems on A if |A| = 2. The following result gives some constraints on |A] and |S]. (Actually 
they are the best possible, but we will not prove this fact.) 


Theorem 1.2. Jf 5 is a Steiner triple system on a finite set A, then 
(a) [8] = [A] - JA] — 1)/6 
(b) |A| = 1 or 3(mod 6). 


ProoF. For (a) note that each member of § contains three distinct pairs of elements of A, 
and as each pair of elements appears in only one member of §, it follows that the number of 
pairs of elements from A is exactly 3]8], i-e., 


(tl) — ai 


111 


112 III Selected Topics 


To show that (b) holds, fix a € A and let 7j,...,7;% be the members of § to which 
a belongs. Then the doubletons T; — {a},...,7, — {a} are mutually disjoint as no pair of 
elements of A is contained in two distinct triples of 8; and A—{a} = (T1—{a})U- --U(T,—{a}) 
as each member of A — {a} is in some triple along with the element a. Thus 2||A| — 1, so 
|A| = 1 (mod 2). From (a) we see that |A] = 0 or 1 (mod 3); hence we have |A| = 1 or 
3 (mod 6). 


Thus after |A] = 3 the next possible size |A| is 7. Figure 20 shows a Steiner triple system 
of order 7, where we require that three numbers be in a triple iff they lie on one of the lines 
drawn or on the circle. The reader will quickly convince himself that this is the only Steiner 
triple system of order 7 (up to a relabelling of the elements). 


ae 4 5 


Figure 20 


Are there some easy ways to construct new Steiner triple systems from old ones? If we 
convert to an algebraic system it will become evident that our standard constructions in 
universal algebra apply. A natural way of introducing a binary operation on A is to require 

a-b=c if {a,b,c} ES. (x) 
Unfortunately this leaves a-a undefined. We conveniently get around this by defining 


a-a=a. («) 


Although the associative law for - fails already in the system of order 3, nonetheless we have 
the identities 
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(Sql) g-aea 

(Sq2) v-yXy-ax 

(Sq3) w-(@-y) © y. 

Definition 1.3. A groupoid satisfying the identities (Sql)—(Sq3) above is called a squag (or 
Steiner quasigroup). 


Now we will show that the variety of squags precisely captures the Steiner triple systems. 


Theorem 1.4. /f (A,-) is a squag define 8 to be the set of three-element subsets {a,b,c} 
of A such that the product of any two elements gives the third. Then & is a Steiner triple 
system on A. 


PROOF. Suppose a- b = c holds. Then 
a (a+b) Sa+c, 


so by (Sq3) 


Continuing, we see that the product of any two of a,b,c gives the third. Thus in view of 
(Sql), if any two are equal, all three are equal. Consequently for any two distinct elements 
of A there is a unique third element (distinct from the two) such that the product of any 
two gives the third. Thus § is indeed a Steiner triple system on A. 


Another approach to converting a Steiner triple system S on A to an algebra is to adjoin 
a new element, called 1, and replace («*) by 


This leads to a groupoid with identity (A U {1},-,1) satisfying the identities 
(SH) eae ti 

(S12) a-yxry-a2 

(S13) 2 - (a@-y) = y. 


Definition 1.5. A groupoid with a distinguished element (A, -, 1) is called a sloop (or Steiner 
loop) if the identities (S11)—(SI8) hold. 


Theorem 1.6. /f (A,-,1) is a sloop and |A| > 2, define § to be the three-element subsets 
of A— {1} such that the product of any two distinct elements gives the third. Then 8 is a 
Steiner triple system on A — {1}. 


PROOF. (Similar to 1.4.) 
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§2. Quasigroups, Loops, and Latin Squares 


A quasigroup is usually defined to be a groupoid (A,-) such that for any elements a,b € A 
there are unique elements c,d satisfying 


a-c=)b 
d-a=b. 


The definition of quasigroups we adopted in II§1 has two extra binary operations \ and /, 
left division and right division respectively, which allow us to consider quasigroups as an 
equational class. Recall that the axioms for quasigroups (A, /,-, \) are given by 


a\(c-y)ey  (x-y)/yru 
a(vly)sy  (e/y)-y eo. 


To convert a quasigroup (A,-) in the usual definition to one in our definition let a/b be 
the unique solution c of c- b =a, and let a\b be the unique solution d of a-d = b. The four 
equations above are then easily verified. Conversely, given a quasigroup (A, /,-,\) by our 
definition and a,b € A, suppose c is such that a-c = b. Then a\(a-c) = a\b; hence c = a\b, 
so only one such c is possible. However, a - (a\b) = b, so there is one such c. Similarly, we 
can show that there is exactly one d such that d- a = b, namely d = b/a. Thus the two 
definitions of quasigroups are, in an obvious manner, equivalent. 

A loop is usually defined to be a quasigroup with an identity element (A,-,1). In our 
definition we have an algebra (A, /,-,\,1); and such loops form an equational class. 

Returning to a Steiner triple system 8 on A we see that the associated squag (A,-) is 
indeed a quasigroup, for if a-c = b then a: (a-c) = a-b, soc =a-b, and furthermore 
a-(a-b) = b; hence if we are given a,b there is a unique c such that a-c = b. Similarly, 
there is a unique d such that d-a = b. In the case of squags we do not need to introduce the 
additional operations / and \ to obtain an equational class, for in this case / and \ are the 
same as -. Squags are sometimes called idempotent totally symmetric quasigroups. 

Given any finite groupoid (A,-) we can write out the multiplication table of (A,-) in a 
square array, giving the Cayley table of (A,-) (see Figure 21). If we are given the Cayley 
table for a finite groupoid (A,-), it is quite easy to check whether or not (A,-) is actually a 
quasigroup. 


Figure 21 
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Theorem 2.1. A finite groupoid A is a quasigroup iff every element of A appears exactly 
once in each row and in each column of the Cayley table of (A,-). 


PRooF. If we are given a,b € A, then there is exactly one c satisfying a-c = b iff b occurs 
exactly once in the ath row of the Cayley table of (A, -); and there is exactly one d such that 
d-a = b iff b occurs exactly once in the ath column of the Cayley table. 


Definition 2.2. A Latin square of order n is an n x n matrix (a;;) of elements from an n 
element set A such that each member of A occurs exactly once in each row and each column 
of the matrix. (See Figure 22 for a Latin square of order 4.) 


Figure 22 


From Theorem 2.1 it is clear that Latin squares are in an obvious one-to-one correspon- 
dence with quasigroups by using Cayley tables. 
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83. Orthogonal Latin Squares 


Definition 3.1. If (a;;) and (b;;) are two Latin squares of order n with entries from A 
with the property that for each (a,b) € A x A there is exactly one index ij such that 
(a, b) = (a;;,;;), then we say that (a;;) and (b;;) are orthogonal Latin squares. 


Figure 23 shows an example of orthogonal Latin squares of order 3. In the late 1700’s 
Euler was asked if there were orthogonal Latin squares of order 6. Euler conjectured: if 
n = 2 (mod 4) then there do not exist orthogonal Latin squares of order n. However he was 
unable to prove even a single case of this conjecture for n > 2. In 1900 Tarry verified the 
conjecture for n = 6 (this is perhaps surprising if one considers that there are more than 
800 million Latin squares on a set of six elements). Later Macneish gave a construction 
of orthogonal Latin squares of all orders n where n # 2 (mod 4). Then in 1959-60, Bose, 
Parker, and Shrikhande showed that n = 2,6 are the only values for which Euler’s conjecture 
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is actually true! Following the elegant presentation of Evans we will show, by converting 
orthogonal Latin squares into algebras, how to construct a pair of orthogonal Latin squares 
of order 54, giving a counterexample to Euler’s conjecture. 


Figure 23 


In view of §2, two orthogonal Latin squares on a set A correspond to two quasigroups 
(A, /,-,\) and (A, ¢,0,%) such that the map (a,b) + (a-b,aob) is a permutation of A x A. 
For a finite set A this will be a bijection iff there exist functions *, and *, from A x A to A 
such that 


*(a-b,aob) =a 
*,(a-b,aob) =b. 
Thus we are led to the following algebraic structures. 


Definition 3.2 (Evans). A pair of orthogonal Latin squares is an algebra 


(A, a % ve g, °, &, *], *y) 
with eight binary operations such that 
(i) (A, fs 5 \) Is a quasigroup 
(ii) (A, 4,0, %) is a quasigroup 
(iii) *(r@-y,roy) a 
(iv) x,(x-y, roy) &y. 


The order of such an algebra is the cardinality of its universe. Let POLS be the variety of 
pairs of orthogonal Latin squares. 


Now let us show how to construct a pair of orthogonal Latin squares of order n for any 
n which is not congruent to 2 (mod 4). 


Lemma 3.3. [fq is a prime power and q > 3, then there is a member of POLS of order q. 


Proor. Let (K,+,-) be a finite field of order q, and let e1,e2 be two distinct nonzero 
elements of K. Then define two binary operations 0, and QO, on K by 


aO,b=e;-at+b. 
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Note that the two groupoids (K,O;) and (K,O») are actually quasigroups, for aD;c = b 
holds iff c = b — e;- a, and dO,a = b holds iff d = e;'- (b— a). Also we have that 


(a 1b, a ab) =e 1d,c 2d) 


implies 
ey:at+b=e,-ct+d 
é2°atb=e,:c+d; 
hence 
ey —c)=d 
i) —c)=d- 


and thus, as e;  é9, 

G—€ and bea; 
Thus the Cayley tables of (A, 0) and (K, Os) give rise to orthogonal Latin squares of order 
q. 


Theorem 3.4. [fn = 0,1, or 3 (mod 4), then there is a pair of orthogonal Latin squares of 
order n. 


Proor. Note that n = 0,1 or 3 (mod 4) iff n = 2°%pf'---p?* with a 4 1, a; > 1, and 
each p; is an odd prime. The case n = 1 is trivial, and for n > 3 use 3.3 to construct 
Ao, Ai,..., Ax in POLS of order 2°, p\*,...,p,* respectively. Then Ag x Ay X--: x Ax is 
the desired algebra. 


To refute Euler’s conjecture we need to be more clever. 


Definition 3.5. An algebra (A, F) is a binary algebra if each of the fundamental operations 
is binary. A binary algebra (A, F’) is idempotent if 


f(u,@) & x 
holds for each function symbol f. 
Definition 3.6. Let IPOLS be the variety of idempotent algebras in POLS. 


Our goal is to show that there is an idempotent pair of orthogonal Latin squares of order 
54. We construct this algebra by using a block design obtained from the projective plane of 
order 7 to paste together some small members of [POLS which come from finite fields. 


Definition 3.7. A variety V of algebras is binary idempotent if 
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(i) the members of V are binary idempotent algebras, and 
(ii) V can be defined by identities involving at most two variables. 


Note that IPOLS is a binary idempotent variety. 
Definition 3.8. A 2-design is a tuple (B, B,,..., B,) where 


(i) B is a finite set, 
) each B; is a subset of B (called a block), 
(iii) |B;| > 2 for all 7, and 
) 


each two-element subset of B is contained in exactly one block. 


The crucial idea is contained in the following. 


Lemma 3.9. Let V be a binary idempotent variety and let (B, By,..., By) be a 2-design. 
Let n = |B|, n; = |B;|. If V has members of size nj, 1 <i <k, then V has a member of 
Size Ths 


Proor. Let A; € V with |A;| = n;. We can assume A; = B;. Then for each binary function 
symbol f in the type of V we can find a binary function f? on B such that when we restrict 
f® to B; it agrees with f4* (essentially we let f? be the union of the f*:). As V can be 
defined by two variable identities p(x, y) © g(x,y) which hold on each A,, it follows that we 
have constructed an algebra B in V with |B| =n. 


Lemma 3.10. [fq 1s a prime power and q > 4, then there is a member of IPOLS of size q. 
In particular, there are members of sizes 5, 7, and 8. 


Proor. Again let K be a field of order gq, let €1,e2 be two distinct elements of AK — {0,1}, 
and define two binary operations 0,0, on K by 


aHjb = e;-a+ (1-4) - 6. 


We leave it to the reader to verify that the Cayley tables of (K,O,) and (K, Og) give rise to 
an idempotent pair of orthogonal Latin squares. 


Now we need a construction from finite projective geometry. Given a finite field F’ of 
cardinality n we form a projective plane P,, of order n by letting the points be the subsets of 
F® of the form U where U is a one-dimensional subspace of F? (as a vector space over F), 
and by letting the lines be the subsets of F® of the form V where V is a two-dimensional 
subspace of F3. One can readily verify that every line of P,, contains n+ 1 points, and every 
point of P,, “belongs to” (i.e., is contained in) n + 1 lines; and there are n? + n+ 1 points 
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and n?+n-+1 lines. Furthermore, any two distinct points belong to exactly one line and 
any two distinct lines meet in exactly one point. 


Lemma 3.11. There is a 2-design (B, B,,..., By) with |B| = 54 and |B;| € {5,7,8} for 
VAs he. 


PROOF. Let 7 be the projective plane of order 7. This has 57 points and each line contains 8 
points. Choose three points on one line and remove them. Let B be the set of the remaining 
54 points, and let the B; be the sets obtained by intersecting the lines of 7 with B. Then 
(B, Bi,..., By) is easily seen to be a 2-design since each pair of points from B lies on a 
unique line of 7, and |B;| € {5,7, 8}. 


Theorem 3.12. There is an idempotent pair of orthogonal Latin squares of order 54. 


PRooF. Just combine 3.9, 3.10, and 3.11. 
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84. Finite State Acceptors 


In 1943 McCulloch and Pitts developed a model of nerve nets which was later formalized as 
various types of finite state machines. The idea is quite simple. One considers the nervous 
system as a finite collection of internal neurons and sensory neurons and considers time as 
divided into suitably small subintervals such that in each subinterval each neuron either 
fires once or is inactive. The firing of a given neuron during any one subinterval will send 
impulses to certain other internal neurons during that subinterval. Such impulses are either 
activating or deactivating. If an internal neuron receives sufficiently many (the threshold of 
the neuron) activating impulses and no deactivating impulses in a given subinterval, then it 
fires during the next subinterval of time. The sensory neurons can only be excited to fire 
by external stimuli. In any given subinterval of time, the state of the network of internal 
neurons is defined by noting which neurons are firing and which are not, and the input during 
any given subinterval to the network is determined by which sensory neurons are firing and 
which are not. We call an input during a subinterval of time a letter, the totality of letters 
constituting the alphabet. A sequence of inputs (in consecutive subintervals) is a word. A 
word is accepted (or recognized) by the neural network if after the sensory neurons proceed 
through the sequence of inputs given by the letters of the word the internal neurons at some 
specified number of subintervals later are in some one of the so-called accepting states. 

In his 1956 paper, Kleene analyzed the possibilities for the set of all words which could 
be accepted by a neural network and showed that they are precisely the regular languages. 
Later Myhill showed the connection between these languages and certain congruences on the 
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monoid of words. Let us now abstract from the nerve nets, where we consider the states as 
points and the letters of the alphabet as functions acting on the states, i.e., if we are in a 
given state and read a given letter, the resulting state describes the action of the letter on 
the given state. 


Definition 4.1. A finite state acceptor (abbreviated f.s.a.) of type F (where the type is 
finite) is a 4-tuple A = (A, Fao, Ao), where (A, F’) is a finite unary algebra of type F, ag € A, 
and Ap C A. The set A is the set of states of A, ao is the initial state, and Ag is the set of 
final states. 


Definition 4.2. If we are given a finite type F of unary algebras, let (F*,-, 1) be the monoid 
of strings on F. Given a string w € F*, an f.s.a. A of type F, and an element a € A, let w(a) 
be the element resulting from applying the “term” w(x) to a; for example if w = fg then 


w(a) = f(g(@)), and 1(a) = a. 


Definition 4.3. A language of type F is a subset of F*. A string w from F* is accepted by 
an f.s.a. A = (A, Fd, Ao) of type F if w(ag) € Ap. The language accepted by A, written 
L(A), is the set of strings from F* accepted by A. (“Language” has a different meaning in 
this section from that given in II§1.) 


Definition 4.4. Given languages L, L,, and Lz of type F let 


[y+ Lo = {wi + we: wi € Ly, we € Lo} and 
L* 


the subuniverse of (F*,-,1) generated by L. 


The set of regular languages of type F is the smallest collection of subsets of F* which contains 
the singleton languages {f}, f € FU {1}, and is closed under the set-theoretic operations, 
U,m,’, and the operations - and * defined above. 


To prove that the languages accepted by f.s.a.’s form precisely the class of regular lan- 
guages it is convenient to introduce partial algebras. 


Definition 4.5. A partial unary algebra of type F is a pair (A, F’) where F is a family 
of partially defined unary functions on A indexed by F, i.e., the domain and range of each 
function f are contained in A. 


Definition 4.6. A partial finite state acceptor (partial f.s.a.) A = (A, Fao, Ao) of type F 
has the same definition as an f.s.a. of type F, except that we only require that (A, F’) be a 
partial unary algebra of type F. Also the language accepted by A, £(A), is defined as in 4.3. 
(Note that for a given w € F*, w(a) might not be defined for some a € A.) 


Lemma 4.7. Every language accepted by a partial f.s.a. is accepted by some f.s.a. 


84. Finite State Acceptors 121 


PROOF. Given a partial f.s.a. A = (A, Fao, Ao) choose b ¢ A and let B = AU {b}. For 
feFandace AU {db}, if f(a) is not defined in A let f(a) = b. This gives an f.s.a. which 
accepts the same language as A. 


Definition 4.8. If (A, F’,ao, Ao) is a partial f.s.a. then, for a € A and w € F*, the range of 
w applied to a, written Rg(w, a), is the set 


{ fala); fn—1fn(@), Pony fi wets fn(a)} 
where w = fi--- fn; and it is {a} if w = 1. 
Lemma 4.9. The language accepted by any f.s.a. is regular. 


Proor. Let L be the language of the partial f.s.a. A = (A, Fao, Ao). We will prove the 
lemma by induction on |A]. First note that @ is a regular language as @ = {f}N{f}’ for any 
f © F. For the ground case suppose |A| = 1. If Ag = @ then L(A) = @, a regular language. 
If Ap = {ao} let 

G={f © F: f(ao) is defined}. 


Then : 
L(A) =S*=(U{Ft] ; 


fEG 


also a regular language. 

For the induction step assume that |A| > 1, and for any partial f.s.a. B = (B, F, bo, Bo) 
with |B| < |A| the language £(B) is regular. If Aj = @, then, as before, L(A) = @, a 
regular language. So assume Ag # @. The crux of the proof is to decompose any acceptable 
word into a product of words which one can visualize as giving a sequence of cycles when 
applied to ao, followed by a noncycle, mapping from ap to a member of Ag if ao ¢ Ao. Let 


C ={(fi, fo) € F x F: fiwfe(ao) = ao for some w € F", 
fo(ao) A ao, and Rg(w; fo(ao)) C A — {ao}} 


which we picture as in Figure 24. Now, for (fi, fo) € C let 


Chips = {w ae as fiw f2(ao) = a0, Rg(w; f2(ao)) GCAa {ao}}. 


Then Cy, f, is the language accepted by 
(A — {ao}, F, fo(ao), fr '(ao) — {ao}); 


hence, by the induction hypothesis, Cy, s, is regular. Let 


H={f €F: f(a) =a} U {1} 
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and 


D={f EF: flao) F ao}. 
For f € D let 


E; ={w € F* : wf(ao) € Ao, Rg(w, f(ao)) C A — {ao}}. 
We see that Ey is the language accepted by 
(A — {ao}, F, f(ao), Ao — {ao}); 
hence by the induction hypothesis, it is also regular. Let 
U Es: {f} if ao ¢ Ao 
fED 


(U Ey {f}) UO} if ag € Ao. 

fED 
Then * 
Lae: (1 U (5) Can) ) 


(fisf2)€C 


a regular language. 


ag 


Figure 24 


Definition 4.10. Given a type F and t ¢ F let the deletion homomorphism 
Or: (FU {t})* — F* 


be the homomorphism defined by 
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Lemma 4.11. Jf L is a language of type FU {t}, where t ¢ F, which is also the language 
accepted by some f.s.a., then 6,(L) is a language of type F which is the language accepted by 
some f.s.a. 


ProoF. Let A = (A, F'U {t}, ao, Ao) be an f.s.a. with L(A) = L. For w € F* define 
Sw = {W(ao) : WE (FU {t})*, 6 (w) = w} 


and let 
B={Sy,:we€ F*}. 


This is of course finite as A is finite. For f € F define 
f(a) SS ais 
This makes sense as S¥,, depends only on S,,, not on w. Next let 
bo — S1, 
and let 
Bo ={Swi SM Ao ~ Sh. 
Then 
(B, F, bo, Bo) accepts w 
iff w(S}) — Bo 
if S,NAp# SP 
iff W(ao) € Ao for some @ € 5, '(w) 


if we L for some w € 6, ‘(w) 


Theorem 4.12 (Kleene). Let L be a language. Then L is the language accepted by some 
fis.a. iff L ts regular. 


PrRooFr. We have already proved (=) in 4.9. For the converse we proceed by induction. If 
L = {f} then we can use the partial f.s.a. in Figure 25, where all functions not drawn are 
undefined, and Ap = {a}. If L = {1} use A = Ap = {ao} with all f’s undefined. 


if 


a «£—_——_———9 dy 
Figure 25 
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Next suppose L, is the language of (A, Fao, Ao) and Lz is the language of (B, F,, bo, Bo). 
Then 1, Lz is the language of (A x B, F, (ag, bo), Ao X Bo), where f((a,b)) is defined to 
be (f(a), f(b)); and Li is the language of (A, F’,a9, A — Ao) (we are assuming (A, F,, do, Ao) 
is an f.s.a.). Combining these we see by De Morgan’s law that L, U Lz is the language of a 
suitable f.s.a. 

To handle L,- Ly we first expand our type to FU {t}. Then mapping each member of Bo 
to the input of a copy of A as in Figure 26 we see that L, - {t}- Lz is the language of some 
f.s.a.; hence if we use 4.11 it follows that L, - D2 is the language of some f.s.a. 


Figure 26 


Similarly for Lj, let t map each element of Ap to ag as in Figure 27. Then (1, - {t})*+ Ly 
is the language of this partial f.s.a.; hence 


Ly = 6:[(L1 - {t})" £1 U {1} 


is the language of some f.s.a. This proves Kleene’s theorem. 


Figure 27 
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Another approach to characterizing languages accepted by f.s.a.’s of type F uses congru- 
ences on (F*,-, 1). 


Definition 4.13. Let 7 be the mapping from F* to T(x), the set of terms of type F over z, 
defined by r(w) = w(z). 


Lemma 4.14. The mapping T is an isomorphism between the monoid (F*,-,1) and the 
monoid (T'(x),0,x), where o is “composition.” 


PROOF. (Exercise.) 


Definition 4.15. For 0 € Con(3*,-, 1) let 
O(a) = {(wi(x), we(x)) : (wi, we) € OF. 
Lemma 4.16. The map @ +> 6(x) is a lattice isomorphism from the lattice of congruences 
of (F*,-,1) to the lattice of fully invariant congruences of T(x). (See 11814.) 
PROOF. Suppose 6 € Con(F*,-,1) and (wi,we) € 6. Then for u € F*, (uwi,uwe) € 6 


suffices to show that 6(x) is a congruence on T(x), and (wiu, wou) € O shows that 6(x) is 
fully invariant. The remaining details we leave to the reader. 


Lemma 4.17. Jf L is a language of type F accepted by some f.s.a., then there is a @ € 
Con(S*,-,1) such that 0 is of finite index (1.e., (F*,-,1)/0 is finite) and L° = L (see I1§6.16), 
i.e., L is a union of cosets of 0. 


PROOF. Choose A an f.s.a. of type F such that L(A) = L. Let F4(Z) be the free algebra 
freely generated by Z in the variety V((A, F’)). Let 


a: T(x) > F4(®) 
be the natural homomorphism defined by a(x) = Z, and let 
8: Fa(z) — (A, F) 


be the homomorphism defined by 3(%) = ao. Then, with 


peB—*(Ao) 
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hence 
Liiva 
As kera is a fully invariant congruence on T(x) we have kera = O(x) for some 0 € 


Con(F*,-, 1). Thus 


hence 
L=L’. 


As ker a is of finite index, it follows that @ is also of finite index. 


Theorem 4.18 (Myhill). Let L be a language of type ¥. Then L is the language of some 
f.s.a. iff there is a @ € Con(F*,-,1) of finite index such that L® = L. 


PROOF. (=) This was handled in 4.17. 
(<=) Suppose @ is a congruence of finite index on F* such that L° = L. Let 


A= {w/0: we F*} 
f(w/0) = fw/@ for fEeF 

ag = 1/0 

Ao = {w/6: w € L}. 


Then 


(A, Fao, Ao) accepts w 
iff w(1/0) € Ao 
iff w/0€ Ag 
iff w/?=u/6 for some u € L 
iff weé L. 


Definition 4.19. Given a language L of type F define the binary relation =; on F* by 

Wi =~ wo iff (uw € LS uw € L for u,v € F*). 
Lemma 4.20. Jf we are given L, a language of type F, then =, is the largest congruence 0 
on (F*,-,1) such that L® = L. 


PRoor. Suppose L’ = L. Then for (w;,w2) € 6 and u,v € F*, (uwyv, uwyv) € O; hence 
uwyv € L & uw € Las uwyv/0 = uwav/8 and L = U,,<, w/0. Thus @ € =; . 
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Next =, is easily seen to be an equivalence relation on F*. If wy =p we and wy, =p we 
then for u,v € F*, 


uwywyu € L 
iff wwywov € L 
iff wwoweu € L; 
hence 
wi =p Wetd2, 
so =, is indeed a congruence on (F*,-, 1). 


If now w € L and w =, w then 


l-w-leLel-w-1leLl 


implies w € L; hence w/ =, C L. Thus L=" = L. 


Definition 4.21. If we are given a language L of type F, then the syntactic monoid Mz, of 
L is defined by 
M, = So Ny tl Ae 


Theorem 4.22. A language L is accepted by some f.s.a. iff Mz, is finite. 


PROOF. Just combine 4.18 and 4.20. 
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Chapter IV 


Starting from Boolean Algebras ... 


Boolean algebras, essentially introduced by Boole in the 1850’s to codify the laws of thought, 
have been a popular topic of research since then. A major breakthrough was the duality 
between Boolean algebras and Boolean spaces discovered by Stone in the 1930’s. Stone also 
proved that Boolean algebras and Boolean rings are essentially the same in that one can con- 
vert via terms from one to the other. Following Stone’s papers numerous results appeared 
which generalized or used his results to obtain structure theorems—these include the work of 
Montgomery and McCoy (rings), Rosenbloom (Post algebras), Arens and Kaplansky (rings), 
Foster (Boolean powers), Foster and Pixley (various notions of primality), Dauns and Hof- 
mann (biregular rings), Pierce (rings), Comer (cylindric algebras and general algebras), and 
Bulman-Fleming, Keimel, and Werner (discriminator varieties). 

Since every Boolean algebra can be represented as a field of sets, the class of Boolean 
algebras is sometimes regarded as being rather uncomplicated. However, when one starts 
to look at basic questions concerning decidability, rigidity, direct products, etc., they are 
associated with some of the most challenging results. Our major goal in this chapter will be 
representation theorems based on Boolean algebras, with some fascinating digressions. 


81. Boolean Algebras 


Let us repeat our definition from II§1. 


Definition 1.1. A Boolean algebra is an algebra (B, V,/A,’,0, 1) with two binary operations, 
one unary operation (called complementation), and two nullary operations which satisfies: 
Bl: (B,V,A) is a distributive lattice 

B2: cA0#0, cV1e1 

Boerne 3 0..9 Ve elk 


Thus Boolean algebras form an equational class, and hence a variety. Some useful prop- 
erties of Boolean algebras follow. 
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Lemma 1.2. Let B be a Boolean algebra. Then B satisfies 
B4: aAb=0 andaVb=1 implya=0U 

Bd: (a2')' eax 

B6: (a Vy) ealAy’(xAy)' x2’ Vy’ (DeMorgan’s Laws). 


PRooF. If 
a\b=0 


then 


=1A(a' vb) 
=a' Vb; 
hence a’ > b, and if 
aVb=1 


then 


Q 
| 
Q 
= 
—— 
Q 
< 
ja 
a 


= 0V(a' Ab) 

=a Nb, 
Thus a’ < b; hence 

b=a' 
This proves B4. 
Now 
a Na=0 and. a’ Va =1: 

hence 


a= (ay 
by B4, so B5 is established. Finally 
(cVy)V (a Ay!) & eV [yv (2 Ay’)] 
eaVvi[yve)AyVvy’)] 
saVyVa' 


el 
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and 
(xVy) A(z Ay) = [zA(@’ Ay) V yA(a'Ay)] 
ZOVO 
= 0. 
Thus by B4 


ae Ay ss (eV yy. 
Similarly (interchanging V and A, 0 and 1), we establish 

ae Vay RS (eA yy 
Perhaps the best known Boolean algebras are the following. 


Definition 1.3. Let X be a set. The Boolean algebra of subsets of X, Su(X), has 
as its universe Su(X) and as operations U,N,’,@,X. The Boolean algebra 2 is given by 
(2,V,A,‘,0,1) where (2, V, A) is a two element lattice with 0 < 1, and where 0’ = 1,1’ = 0; 
also 1 = ({@},V,A,’, Z, ). 

It is an easy exercise to see that if |X| = 1 then Su(X) = 2; and Su(@) = 1. 
Lemma 1.4. Let X be a set. Then Su(X) = 2*. 
PRoor. Let a: Su(X) — 2* be such that 

a(Y)(z4)=1 iff rey. 


Then a is a bijection, and both a and a~! are order-preserving maps between (Su(X), C) 


and (2*, <); hence we have a lattice isomorphism. Also for Y C X 


a(Y"\(x7)=1 iff c¢Y 
it .@(Y) (a= 0: 


hence 


sO 


As 


we have an isomorphism. 
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Definition 1.5. If B is a Boolean algebra and a € B, let Ba be the algebra 


((0, al, V;, AS rl 0, a) 


where [0,a] is the interval {c € B: 0 < a < a},V and A are the same as in B except 
restricted to [0,a], and b* is defined to be aA VU’. 


Lemma 1.6. Jf B is a Boolean algebra anda € B then Bfa 1s also a Boolean algebra. 


PROOF. Clearly ([0,a],V,/) is a distributive lattice, as it is a sublattice of (B,V, A). For 
b € [0, a] we have 


bv 0, bV a= a, 
bAB =bA(aND) 
= 0, 
bVb* =bV (aA0) 
=(aAb)V (aNt') 
=aA(bVJU) 
al 


= a. 


Thus Bfa is a Boolean algebra. 


Lemma 1.7. Jf B is a Boolean algebra anda € B then the map 


:B—- Bla 


defined by 


aa(b) =aAb 


is a surjective homomorphism from B to Bia. 
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PrROoF. If b,c € B then 


ag(bVc) =aA(bVc) 
=(aAb)V (adc) 
= aq(b) V aa(c), 

ag(bAc) =aA(bAc) 
= (aAb)A (adc) 
= aq(b) A aa(c), 

(0) =aA0 

=(aAa)V(aAJ0) 
=a/(a' V0’) 
SGI a Nb 
= (aq(b))*, 

ag Oy 0" vanid: ob) 


Thus a, is indeed a homomorphism. 

Theorem 1.8. Jf B is a Boolean algebra anda € B, then 
B=Bfax Bld’. 

PROOF. Let 


a:B— Blax Bta' 


be defined by 
a(b) = (aa(b), dar(d)). 


It is easily seen that a is a homomorphism, and for 
(b,c) € Blax Bfa’ 


we have 


a(bVc) = (aA(bVc),a’ A(bVc)) 


= (b,c) 
aN (bVc)=(aAb)V (adc) 
= b7'0 
= b, 
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etc. Thus a is surjective. Now if 
a(b) = a(e) 


for any b,c € B then 
a\b=aAc and a’Ab=a' Ac 


SO 
(aA b) V (a Ab) = (adc) V (a' Ac); 
hence 
(aVa')Ab=(aVa')Ac, 
and thus 


b=c. 


This guarantees that a is the desired isomorphism. 


Corollary 1.9 (Stone). 2 is, up to isomorphism, the only directly indecomposable Boolean 
algebra which is nontrivial. 


ProoF. If B is a Boolean algebra and |B| > 2, let ae B, 0<a<1. Then0O<dad <1, 
and hence both Bf a and B/ a’ are nontrivial. From 1.8 it follows that B is not directly 
indecomposable. 


Corollary 1.10 (Stone). Every finite Boolean algebra is isomorphic to the Boolean algebra 
of all subsets of some finite set X. 


PROOF. Every finite Boolean algebra B is isomorphic to a direct product of directly inde- 
composable Boolean algebras by 1187.10; hence B = 2” for some finite n. Now 1.4 applies. 


Definition 1.11. A field of subsets of the set X is a subalgebra of Su(X), i-e., a family of 
subsets of X closed under union, intersection, and complementation and containing @ and 
X, with the operations of Su(X). 


Corollary 1.12. Every Boolean algebra is isomorphic to a subdirect power of 2; hence 
(Stone) every Boolean algebra is isomorphic to a field of sets. 


PROOF. The only nontrivial subdirectly irreducible Boolean algebra is 2, in view of 1.9. 
Thus Birkhoff’s theorem guarantees that for every Boolean algebra B there is an X and a 
subdirect embedding a : B — 2*; hence by 1.4 there is an embedding 3 : B — Su(X). 


Definition 1.13. Let BA be the class of Boolean algebras. 
The next result is immediate from 1.12. 


Corollary 1.14. BA =V(2) = ISP(S) = IPs(S), where S = {1, 2}. 
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EXERCISES §1 


il 


10. 


10m 


A subset J of a set I is a cofinite subset of I if I — J is finite. Show that the collection 
of finite and cofinite subsets of J form a subuniverse of Su(/). 


If B, and By are two finite Boolean algebras with |B,| = |B2|, show B; = By. 


Let B be a Boolean algebra. An element b € B is called an atom of B if b covers 0 (see 
1§1). Show that an isomorphism between two Boolean algebras maps atoms to atoms. 


Show that an infinite free Boolean algebra is atomless (i.e., has no atoms). 


Show that any two denumerable atomless Boolean algebras are isomorphic. [Hint: 
Let Bo, Bi be two such Boolean algebras. Given an isomorphism a : Bj — Bj, Bi a 
finite subalgebra of B;, and Bj) < Bj < Bo with Bj finite, show there is a BY with 
B{ < BY < B, and an isomorphism 3 : Bj — Bi extending a. Iterate this procedure, 
alternately choosing the domain from Bo, then from Bj.] 


If B is a (nontrivial) finite Boolean algebra, show that the subalgebra of B®” generated 
by the projection maps 7 : B? — B, where ™(f) = f(b), has cardinality ore, 


. Let F(n) denote the free Boolean algebra freely generated by {%1,...,%,}. Show F(n) = 


2?" (Hint: Use Exercise 6 above and II§11 Exercise 5.] 


. If Bis a Boolean algebra and a,b € B with aA b= 0 are such that Bla = B[b, show 


that there is an automorphism a of B such that a(a) = b and a(b) =a. 


. If A is an algebra such that Con A is a distributive, show that the factor congruences 


on A form a Boolean lattice which is a sublattice of Con A. 


Let B be a subalgebra of Su(X). Show that B = {Y CX: (YNZ')U(ZNY’) is finite 
for some Z € B} is a subuniverse of Su(X), and B contains all the atoms of Su(X). 


Given a cardinal k > w and a set X show that {Y C X:|Y|<kKor|X—-Y|<khisa 
subuniverse of Su(X). 


The study of cylindric algebras (see II§1) has parallels with the study of Boolean algebras. 
Let CA, denote the class of cylindric algebras of dimension n, and let c(x) be the term 
co(e1(..- (Cn—1(@))...)). We will characterize the directly indecomposable members of C'Ay, 
below. 
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Show C'A, satisfies the following: 


For A € CA, and a € A with c(a) = a define Afa to be the algebra ((0,a],V,A,*, 
Co,-++;Cn—1,0, a, doo A a,.--,;dn—1sn-1 Aa), where the operations V,/A,Co,...,Cn—1 are 
the restrictions of the corresponding operations of A to [0,a], and 2* =a 2’. Show 
Alfa €CA,, and the map a: A > A/a defined by a(b) = b Aa is a surjective 
homomorphism from A to Afa. 


If A € CA,,a € A, show that c(a) = a implies c(a’) = a’. Hence show that if c(a) =a 
then the natural map from A to A fax A fa’ is an isomorphism. Conclude that 
A € CA, is directly indecomposable iff it satisfies a 4 0 — c(a) = 1 fora € A. 


A member of CA, is called a monadic algebra. Show that the following construction 
describes all finite monadic algebras. Given finite Boolean algebras B,,..., By, define 
co on each B; by co(0) = 0 and co(a) = 1 if a £ 0, and let doo = 1. Call the resulting 
monadic algebras B;. Now form the product Bj x --- x Bj. 


Boolean Rings 


The observation that Boolean algebras could be regarded as rings is due to Stone. 


Definition 2.1. A ring R = (R,+,-,—,0,1) is Boolean if R satisfies 


v2 OL. 


Lemma 2.2. Jf R is a Boolean ring, then R. satisfies 


xtaer0 and «c-yry-s. 
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PRooF. Let a,b € R. Then 
(ata) =a+a 


implies 
C+ete +a Sata: 
hence 
aA+ra+ra+a=ara, 
SO 
ata=0. 
Thus 
RE«z+270. 
Now 
(a+b)? =a+5, 
SO 
avta-b+b-a+¥=a++b; 
hence 
at+a-b+b-a+b=a+b, 
yielding 
a-b+b-a=0. 
As 
a-b+a-b=0 
this says 
a-b+a-b=a-6+b-a, 
sO 
a-b=b-a. 
Thus 


REa«-yy-h. 


Theorem 2.3 (Stone). (a) Let B = (B,V,A,',0,1) be a Boolean algebra. Define B® to be 
the algebra (B,+,-,—,0,1), where 


at+b=(aAJb')V (a' Ab) 
a-b=aAb 


—a= a. 


Then B® is a Boolean ring. 
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(b) Let R = (R,+,-,—,0,1) be a Boolean ring. Define R® to be the algebra (R, V,/A,',0, 1) 
where 
aVb=a+b+a-b 
a\b=a-b 
a’ =1+a. 


Then R® is a Boolean algebra. 
(c) Given B and R as above we have B®® = B, R®® = R. 


PROOF. (a) Let a,b,c € B. Then 


(i) a+0=(aA0') V(a' AO) 
=aAl 

(ii) a+b=(aAd')V (a Ab) 
= (bAa') V(b’ Aa) 


(iii) ata=(aAd)V(a' Aa) 


ae, 


(iv) a+(b+c)= [aA (b+0c)]V [aA (b+ 0)] 
={aN [(bACc)V(U Ac} V {a' A [(BAC) V (0 Ac)}} 
={an[UVc)A(bVE)}V{(d AbAC)V (a Ab Ac)} 
= {aA [WU Ac)V(cAbB]}EV {(a AAC) V (a AU Ac)} 
=(aANUAC)V(adNbAc)V(a AbAC)V (a Ab Ac) 
=(aNbAc)V(aAUAC)V(bAC Aad) V(cAd AO). 


The value of this last expression does not change if we permute a,b and c in any manner; 
hence c+ (a+b) =a+(b+c), so by (ii) (a+b) +e=a+(b+c). 


(vi) a-(b-c) =a (bAc) 
=(aAb)Ac 
=(a-b)-c 


(vii) a-(b+c) =a |[(bAC)V(U Ac)] 
=(aNbAc)V(aAv' Ac) 
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and 
(a-b) + (a-c) = [(aAb) A (adc) V [(aAb) A (aAc)] 
= [(aAb)A(a Ve) V (a VB) A (adc) 
=(aNbAc)V(!Aadc) 


a: (b+c) =(a-b)+(a-c). 


(viii) a-a=aAa 
=a. 


Thus B® is a Boolean ring. 
(b) Let a,b,c € R. Then 


(i) aVb=a+b-+a-b 
=b+a+tb-a 
=bVa 
(ii) aAb=a-b 
=b-a 
=bAa 
(iii) aV(bVc)=a+(bVc)+a-(bVc) 
=a+(b+c+b-c)+a-(b+c+5-c) 
a a? Seba! ag lO Oh a? 


=a 


The value of this last expression does not change if we permute a,b and c, so 
aV (bVc)=cV(aVb); 


hence by (i) above 
aV(bVc)=(aVb)Ve. 


(iv) aA(bAc) =a: (b-c) 
=(a-b)-c 
=(aAb)Ac 

(v) aVa=a+a+a-a 

=O+a 


=a 
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(vi) a\a=a-a 


=a 


(vii) aV(aAb) =a+(aAb)+a-(aAb) 
=a+a-b+a-(a-b) 
=at+a-b+a-b 
=0 

(viii) aA(aVb)=a-(a+b-+a-b) 
=a?+a-b+a*-b 
=ata-b+a-b 
=6 

(ix) aA(bVc)=a-(b+ct+b-c) 
=a-b+a-ct+a-b-c 
=a-b+a-cta-b-a-c 
=(aAb)V (adc) 

(x) aA0=a-0 


=0 
(xi) aVl=a+l+a-1 
= 
(xii) aAa’ =a-(1+a) 
=a+@ 
= Gara 
=0 


(xiii) a@Va'=a+(1+a)+a-(1+a) 
=at+l+at+a+@ 
le 


Thus R® is a Boolean algebra. 


(c) Suppose B = (B,V,A,’,0,1) is a Boolean algebra and a,b € B. Then with B® = 
(B, +, ae 0; 1) 


(i) a-b=aAb 
(ii) l+a=(lAd)V(l'Aa) =a 
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(iii) a+b+a-b=a+(b+a-b) 
=a+b-(1+a) 
=a+t+b-a' 
=|aA(bAa’)']V [a A(bAa@)| 
= [aA (b' Va)| V [a’ Ad] 
=aV (a' Ab) 
= (aVa') A(aVb) 
=aV b. 


Thus B®? = B; 
Next suppose R = (R,+,-,—,0,1) is a Boolean ring. Then with R® = (R,V,A,’,0,1) 
and a,b € R, 


(i) (aAvW)V (a Ab) = [a-(1+b)] + [A +a)-b] 4 [a- (1+b)-(1 +a) -) 
=|at+a-b]+[b+a-b] +0 
=a+b 


(ii) aAb=a-b. 


Thus R®® = R. 


The reader can verify that ® has nice properties with respect to H, S, and P; for example, 
if B,;, Bz € BA, then 


(i) If a: By — By is a homomorphism then a : BP — B? is a homomorphism between 
Boolean rings. 


(iii) If B; € BA for i € J then (T],<7Bi)*° = Tie, BE. 
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EXERCISES §2 


1. If A is a Boolean algebra [Boolean ring] and Ao is a subalgebra of A, show A® is a 
subalgebra of A®. 
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If A;, A> are Boolean algebras [Boolean rings] and a : A; — Ag is a homomorphism, 
then a is also a homomorphism from A? to A®. 


If (A,)icy is an indexed family of Boolean algebras [Boolean rings], then (I] A,)° = 


Tics Av’. 


If we are given an arbitrary ring R, then an element a € R is a central idempotent 
if a-b = b-a for all b € R, and a? = a. If R is a ring with identity, show that 
the central idempotents of R form a Boolean algebra using the operations: a V b = 
at+b—a-b, aAb=a-b, anda =1-—a. 


tel 


If 6 is a congruence on a ring R with identity, show that @ is a factor congruence 
iff 0/0 is a principal ideal of R generated by a central idempotent. Hence the factor 
congruences on R form a sublattice of Con R which is a Boolean latttice. 


An ordered basis (Mostowski/Tarski) for a Boolean algebra B is a subset A of B which 
is a chain under the ordering of B, 0 ¢ A, and every member of B can be expressed in the 
form ay +---+4dy, a; € A. 


6. 
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If A is an ordered basis of B, show that 1 € A, A is a basis for the vector space (B, +) 
over the two-element field, and each nonzero element of B can be uniquely expressed 
in the form a; +---+a, with a; € A, a, < dg < +--+ < dp. 


Show that every countable Boolean algebra has an ordered basis. 


Filters and Ideals 


Since congruences on rings are associated with ideals, it follows that the same must hold 
for Boolean rings. The translation of Boolean rings to Boolean algebras, namely R +> R®, 
gives rise to ideals of Boolean algebras. The image of an ideal under ’ gives a filter. 


Definition 3.1. Let B = (B,V,/,',0,1) be a Boolean algebra. A subset J of B is called 
an ideal of B if 


(i) OE TF 
(ii) abe ITSavbel 


(iii) a€elandb<asbdel. 


A subset F’ of B is called a filter of B if 


(i) le F 


(ii) abe F>aANbeF 
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Theorem 3.2. Let B= (B,V,/,',0,1) be a Boolean algebra. Then I is an ideal of B iff I 
is an ideal of B®. 

ProoF. Recall that J is an ideal of a ring B® iff 


Oe 7, 
abeIlsa+bel 


as —b = 6, and 
a€lbeR=a-bel. 


So suppose J is an ideal of B. Then 


OE T, 
and if a,b € I then 
ab! <a, 
a’ Ab<b, 
=e) 
aNv'=.a Abe T; 
hence 


at+tb=(aAv)V(a' Ab) El. 


Now ifa€ J and be€ B then 
aNb<a, 


sO 
a-b=aAbel. 


Thus J is an ideal of B®. 
Next suppose J is an ideal of B®. Then 0 € J, and if a,b € I then 
a-beT; 
hence 
aVb=a+t+b-+a-bel. 


Ifa € J and b<athen 
b=aAb=a-bel, 


so I is an ideal of B. 
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Definition 3.3. If X C B, where B is a Boolean algebra, let 


X'= {a': aE X}. 


The next result shows that ideals and filters come in pairs. 


Lemma 3.4. Given a Boolean algebra B, then 
(a) For I CB, I is an ideal iff I' is a filter, 
(b) For F CB, F is a filter iff F’ is an ideal. 


PROOF. First 
OEel if 1=0er. 


If a,b € I then 
aVbel iff (avb/=aAVeETr. 


For a € I we have b < a iff a’ < 0’; hence b € J iff b! € I’. This proves (a), and (b) is handled 
similarly. 


The following is now an easy consequence of results about rings, but we will give a direct 
proof. 


Theorem 3.5. Let B be a Boolean algebra. If 6 is a binary relation on B, then @ is a 
congruence on B iff 0/0 is an ideal, and for a,b € B we have 


(a,b)e€O iff a+beE0/8. 


PROOF. (=) Suppose @ is a congruence on B. Then 


0€ 0/6, 
and if a,b € 0/@ then 
a,0) € 8, 
(b, 0) € 9, 
so 
(aV b,0V0) € 84, 
1.€., 


aVbeo/é. 


Now if a € 0/6 and b < a then 
(a,0) € 6 
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so 
(aA b,0A b) € 8, 
ea 
(b, 0) € 8; 
hence 
bE 0/8. 
This shows that 0/0 is an ideal. Now 
(a,b) € 6 
implies 
(a+ b,b+ 6) € 8, 
Les 
(a+6,0) €9, 
so 
a+be0/6. 
Conversely, 
a+be0/0 
implies 
(a + 6,0) € 8, 
so 
((a+ 6) +6,0+5) € 8, 
thus 


(a,b) € 8. 


(<) For this direction, first note that @ is an equivalence relation on B as 


(a,a) € 9, 
since 

at+ta=Q0, 
for a € B; if 

(a,b) € 6 
then 

(b,a) € 6 
as 


a+b=b+a; 
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and if 
(a,b) €@, 
(b,c) € 6 
then 
at+c=(a+b)+(b+c) € 0/9; 
hence 


(a,c) € 6. 
Next to show that 6 is compatible with the operations of B, let a1, a2, b,,b2 € B with 


(a1, a2) E 0, 
(by, bo) € 8. 
Then 
(ay /\ b;) + (a2 /\ by) — (ay , by) + (a2 i by) 
= (ay . by) + [(a1 2 bz) + (ay ° by) | + (a2 : bz) 
= a,°- (by + bg) + (ay + a2) »bg € 0/8, 
sO 
(ay ix by, ag ix bs) E @. 
Next 


(az V by) + (a2 V bg) = (ay + by + ay - by) + (Ag + be + ae - be) 
= (a, a) (by oy) (a1 A by + a2 A bg) € 0/0 


as each of the three summands belongs to 0/6, so 
(ay V by, ag V by) EO. 
From 
a, + a2 € 0/0 


it follows that 
(1+ a1) + (1+ a2) € 0/6, 


so 
(a,a5) € 0. 


This suffices to show that 6 is a congruence. 
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Definition 3.6. If J is an ideal of a Boolean algebra B, let B/J denote the quotient algebra 
B/0, where 
(a,b) e@ iff a+bel. 


Let b/I denote the equivalence class b/@ for b € B. If F is a filter of B let B/F denote B/F" 
and let b/F' denote b/F” (see 3.3). 


Since we have established a correspondence between ideals, filters, and congruences of 
Boolean algebras, it is natural to look at the corresponding lattices. 


Lemma 3.7. The set of ideals and the set of filters of a Boolean algebra are closed under 
arbitrary intersection. 


PROOF. (Exercise). 


Definition 3.8. Given a Boolean algebra B and aset X C B let I(X) denote the least ideal 
containing X, called the ideal generated by X, and let F'(X ) denote the least filter containing 
X, called the filter generated by X. 


Lemma 3.9. For B a Boolean algebra and X C B, we have 
(a) 1(X) = {bE B:b<b, V---V by for some by,..., bn € X}U {0} 
(b) F(X) = {bE B:b>bA---A by for some bi,...,6n € X}U {1}. 


ProoF. For (a) note that 
je TX), 


and for b),...,b, € X we must have 


dV es VOSTOK), 


sO 
I(X)D> {bE B:b<b,V---Vb, for some bj,...,b, € X}U {0}. 


All we need to do is show that the latter set is an ideal as it certainly contains X, and for 
this it suffices to show that it is closed under join. If 


b<b V--+V by, 
CSG V+ V Gm, 


with by,...,6n,C1,---,Cm © X then 


bVE<b V+ Vb, VG Vit eV Gm 


so I(X) is as described. The discussion of F(X) parallels the above. 
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Lemma 3.10. Let B be a Boolean algebra. 
(a) The set of ideals of B forms a distributive lattice (under C) where, for ideals [y, Io, 


Lyf foe= Perils; 
LVIg={a€ Bia<aV ap for some ay € I, a2 € Ip}. 


(b) The set of filters of B forms a distributive lattice (under C) where, for filters F\, Fy, 


PAF, =KhOf, 
FLV Fy = {a€ B:a>a,A az for some ay € Fi, a2 € Fo}. 


(c) Both of these lattices are isomorphic to Con B. 
PROOF. From 3.5 it is evident that the mapping 
0 ++ 0/6 
from congruences on B to ideals of B is a bijection such that 
6,260 iff 0/0; C 0/6. 


Thus the ideals form a lattice isomorphic to Con B. The calculations given for A and V 
follow from 3.7 and 3.9. The filters are handled similarly. The distributivity of these lattices 
follows from the fact that Boolean algebras form a congruence-distributive variety, see I1§12, 
or one can verify this directly. 


A remarkable role will be played in this text by maximal filters, the so-called ultrafilters. 


Definition 3.11. A filter F of a Boolean algebra B is an ultrafilter if F is maximal with 
respect to the property that 0 ¢ F. A mazimal ideal of B is an ideal which is maximal 
with respect to the property that 1 ¢ J. (Thus only non-trivial Boolean algebras can have 
ultrafilters or maximal ideals. ) 


In view of 3.4, F' is an ultrafilter of B iff F’ is a maximal ideal of B, and J is a maximal 
ideal of B iff J’ is an ultrafilter. The following simple criterion is most useful. 


Theorem 3.12. Let F be a filter [I be an ideal] of B. Then F is an ultrafilter [I is a 
maximal ideal] of B iff for any a € B, exactly one of a,a' belongs to F {belongs to I]. 


PROOF. Suppose F' is a filter of B. 
(=) If F is an ultrafilter then 
B/F~2 
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by 1.9 as B/F is simple by II§8.9. Let 
v:B-B/F 


be the natural homomorphism. For a € B, 


Halli or ve) =1/F, 


as B/F & 2; hence 
acF or aveF. 


If we are given a € B then exactly one of a,a’ is in F' as 
aha =0€F. 
(<) For a € B suppose exactly one of a,a’ € F. Then if F, is a filter of B with 
FCF, and FFF, 
let a € F, — F. Asa’ € F we have 
O=adad €F; 


hence fF; = B. Thus F is an ultrafilter. 
The ideals are handled in the same manner. 


Corollary 3.13. Let B be a Boolean algebra. 

(a) Let F be a filter of B. Then F is an ultrafilter of B iff 0 ¢ F and fora,b € B, avbe F 
fae F orbe F. 

(b) (Stone) Let I be an ideal of B. Then I is a maximal ideal of B iff 1 ¢ I and for 
abe B, aNbeliffael orbel. 


PROOF. We will prove the case of filters. 
(=) Suppose F is an ultrafilter with 


aVoeF. 


As 
(aVb)A(av AU) =0€¢F 


we have 
a Ab! ¢ F; 
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hence 
agéF or UGF. 


By 3.12 either 
ac€éF or beEF. 


(<) Since 1 € F, given a € B we have 
aVa eF; 


hence 
ac€F or @eF. 


Both a,a’ cannot belong to F’ as 
aha =0¢€¢F. 


Definition 3.14. An ideal J of a Boolean algebra is called a prime ideal if 1 ¢ I and 
aN\bel implies a€lorbel. 
Thus we have just seen that the prime ideals of a Boolean algebra are precisely the 
maximal ideals. 


Theorem 3.15. Let B be a Boolean algebra. 
(a) (Stone) If a € B — {0}, then there is a prime ideal I such that a ¢ I. 
(b) Ifa € B— {1}, then there is an ultrafilter U of B with a ¢ U. 


PROooF. (a) If a € B — {0}, let 


a:B—3 2? 
be any subdirect embedding of B into 2” for some J (see 1.12). Then 
a(a) # a(0), 
so for some j € J we have 
(1; 0 a)(a) F (mj 0 @)(0). 


As 
mj0oa:B2 


is onto it follows that 
6 = ker(7; 0 a) 


is a maximal congruence on B; hence 


1 =0/6 


§3. Filters and Ideals 151 


is a maximal ideal, thus a prime ideal, and a ¢ I. 
(b) is handled similarly. 


Lemma 3.16. Let B, and By be Boolean algebras and suppose 
a: B,; > Bo 
is a homomorphism. If U is an ultrafilter of Bz, then a~1(U) is an ultrafilter of By. 
ProoF. Let U be an ultrafilter of By and @ the natural map from By to B2/U. Then 
a“'(U) = (B0a)*(), 


hence a~1(U) is an ultrafilter of B, (as the ultrafilters of B, are just the preimages of 1 
under homomorphisms to 2). 


Theorem 3.17. Let B be a Boolean algebra. 

(a) If F is a filter of B anda € B-— F, then there is an ultrafilter U with F C U and 
a ZU. 

(b) (Stone) If I is an ideal of B anda € B —I, then there is a maximal ideal M with 
ICM andag M. 


ProoF. For (a) choose an ultrafilter U* of B/F by 3.15 with 
a/F €éU*. 


Then let U be the inverse image of U* under the canonical map from B to B/F. (b) is 
handled similarly. 


EXERCISES 83 


1. If B is a Boolean algebra and a,b, c,d € B, show that 


(a,b) € O(c, d) S@at+b<c+d. 


2. If B is a Boolean algebra, show that the mapping a from B to the lattice of ideals of 
B defined by a(b) = I(b) embeds the Boolean lattice (B, V,/) into the lattice of ideals 
of B. 


3. If U is an ultrafilter of a Boolean algebra B, show that /\ U exists, and is an atom 6 or 
equals 0. In the former case show U = F'(b). (Such an ultrafilter is called a principal 
ultrafilter. ) 


4. If B is the Boolean algebra of finite and cofinite subsets of an infinite set J, show that 
there is exactly one nonprincipal ultrafilter of B. 
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5. If H = (A,V,/A,—,0,1) is a Heyting algebra, a filter of H is a nonempty subset F’ of 
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H such that (i) a,b € F > a Ab e€ F and (ii) ae Fy a< b> 0 € F. Show (1) if 
6 € Con H then 1/6 is a filter, and (a,b) € @ iff (a — b) A (b > a) € 1/8, and (2) if 
F is a filter of H then 6 = {(a,b) € H?: (a + b) A(b a) € F} is a congruence and 
P= 1/0. 


If A = (A,V,A,‘,0,---5Cn—1, 0, 1, doo,..-,;@n—in—1) is a cylindric algebra, a cylindric 
ideal of A is a subset I of A which is an ideal of the Boolean algebra (A, V, A,’,0, 1) and 
is such that c(a) € I whenever a € J. Using the exercises of §1 show (1) if @€ Con A 
then 0/6 is a cylindric ideal and (a,b) € 6 iff a+b € 0/6, and (2) if J is a cylindric 
ideal of A then 6 = {(a,b) € A? : a+b € J} is a congruence on A with I = 0/06. 


Show that a finite-dimensional cylindric algebra A is subdirectly irreducible iff a € A 
and a 4 0 imply c(a) = 1. 


Stone Duality 


We will refer to the duality Stone established between Boolean algebras and certain topo- 
logical spaces as Stone duality. In the following when we speak of a “clopen” set, we will 
mean of course a closed and open set. 


Definition 4.1. A topological space is a Boolean space if it (i) is Hausdorff, (ii) is compact, 
and (iii) has a basis of clopen subsets. 


Definition 4.2. Let B be a Boolean algebra. Define B* to be the topological space whose 
underlying set is the collection B* of ultrafilters of B, and whose topology has a subbasis 
consisting of all sets of the form 


Nz = {U € B*:aeJU}, 


fora € B. 


Lemma 4.3. Jf B is a Boolean algebra and a,b € B then 


and 


Na U Np = Navo; 
Na M7 No — None; 
Ny = (N,y. 


Thus in particular the N,’s form a basis for the topology of B*. 
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PROOF. 
UEN,UN, iff acUorbeU 
if aVbeU 
iff U © Navp. 
Thus 
N,U Ns = Navo- 


The other two identities can be derived similarly. 


Lemma 4.4. Let X be a topological space. Then the clopen subsets of X form a subuniverse 


of Su(X). 


PROOF. (Exercise.) 


Definition 4.5. If X is a topological space, let X* be the subalgebra of Su(X) with universe 
the collection of clopen subsets of X. 


Theorem 4.6 (Stone). (a) Let B be a Boolean algebra. Then B* is a Boolean space, and B 
is isomorphic to B** under the mapping 


at> Na. 


(b) Let X be a Boolean space. Then X* is a Boolean algebra, and X is homeomorphic 
to X** under the mapping 
rre{NexX*:xreE N}. 


ProoF. (a) To show that B* is compact let (Na)acz be a basic open cover of B*, where 
J C B. Now suppose no finite subset of J has 1 as its join in B. Then J is contained in a 
maximal ideal M, and 


U=M' 
is an ultrafilter with 
UNJ=2@. 
But then 
U¢N, 


for a € J, which is impossible. Hence for some finite subset Jo of J we have 


VV Jo = 1. 
VV Jo EU 
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for every ultrafilter U we must have 
UEN, 


for some a € Jp by 3.13, so (Na)acy is a cover of B*. Thus B* is compact. It clearly has a 
basis of clopen sets as each N, is clopen since 


Nan Na = 2, 
N,U Nw = B*. 
Now if 
U, 4 U2 
in B* let 
ae U; — Us. 
Then 
U, E Na, 
U2 € Nats 


so B* is Hausdorff. Thus B* is a Boolean space. 
The mapping a +> N, is clearly a homomorphism from B to B™ in view of 4.3. Ifa,b € B 
and a # b then 
(aV b) A(aAb)' £0, 


so by 3.15(a) there is a prime ideal J such that 
(aV b) A (aAb)' € I, 
so there is an ultrafilter U(= I’) such that 


(aVb) A(aAb)' EU. 


But then 

(aAb)' EU 
so 

aNb€¢U; 
hence 

agU or b€¢U; 

but as 

aVoEeU 
we have 


acU or VEU 


84. Stone Duality 155 


so exactly one of a,b is in U; hence 
Nz # Np. 
Thus the mapping is injective. If now N is any clopen subset of B* then, being open, N is a 
union of basic open subsets N,, and being a closed subset of a compact space, N is compact. 
Thus JN is a finite union of basic open sets, so N is equal to some N,, by 4.3. 
Thus B = B*™ under the above mapping. 
(b) X* is a Boolean algebra by 4.4. Let 


a:X — X*™* 


be the mapping 
a(x) ={NeEX*:reE N}. 
(Note that a(x) is indeed an ultrafilter of X*). If 
x.yEeEX and «rH#y 


then 

a(x) # aly) 
as X is Hausdorff and has a basis of clopen subsets. If U is an ultrafilter of X* then U is 
a family of closed subsets of X with the finite intersection property, so as X is compact we 
must have 


()U #4 @. 
It easily follows that for x € (| U, 

U Ca(z); 
thus 

Uae) 


by the maximality of U. Thus a is a bijection. 
A clopen subset of X** looks like 


{UE x™:NEU} 
for N € X*,i.e., for N a clopen subset of X. Now 


a(N) = {U € X™: a(x) =U for some x € N} 
={UEx*: NEU}, 


so @ is an open map. Also 
a{UEex™:NeUs={rEeX:a(x)e{UEX*™:NEU}} 


={xreX:reEN} 
—N, 
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so a is continuous. Thus a is the desired homeomorphism. 


Definition 4.7. Given two disjoint topological spaces, X,, X2, define the union of X1, X2 
to be the topological space whose underlying set is X;UX» and whose open sets are precisely 
the subsets of the form O; U O2 there O; is open in Xj. 

Given two topological spaces X,, Xo, let 


X1U Xo 
denote the topological space whose underlying set is 
{1} x X1 U {2} x Xe 
and whose open subsets are precisely the subsets of the form 
{1} x O, U {2} x O2 
where O; is open in X;, i= 1,2. X; U Xo is called the disjointed union of X1, Xo. 
The next result is used in the next section. 


Lemma 4.8. Given two Boolean algebras B, and Bz, the Boolean spaces (B, x Bz)* and 
Bj U BS are homeomorphic. 


ProoFr. The case that |B,| = |B2| = 1 is trivial, so we assume |B, x By| > 2. Given an 
ultrafilter U in (B, x B2)*, let 7;(U) be the image of U under the projection homomorphism 


T; : By x By — B,. 


Claim: U =7,(U) x By or U = B, x m2(U). 
To see this, note that 
(1,0) V (0,1) = (1,1) EU 
implies 
(1,0)€U or (0,1) EU. 
If (1,0) € U, then 
(bi, be) eU> (b1, 0) = (by, be) A (1, 0) EU; 


hence 

m™(U) x {O} CG U, 
0) 

m™(U) x Bo = U. 
As 


U Cc m™(U) x T72(U) 


84. Stone Duality 157 


we have 


U = 7(U) x Bg. 


Likewise we handle the case (0,1) € U. This finishes the proof of the claim. 
From the claim it is easy to verify that either 7,(U) or 72(U) is a filter, and then an 
ultrafilter. So let us define the map 


8: (Bi x Bg)* > Bi U BS 


by 

BU) = {i} x m(U) 
for 7 such that 7;(U) is an ultrafilter of B;. The map ( is easily seen to be injective in view 
of the claim. If U € By then U x By € (By, x Bz)*, so 


B(U x Bo) = {1} x U, 
and a similar argument for U € B% shows (3 is also surjective. Finally, we have 


B(Ne1,2)) = {8(U) : U € (By x Bo)", (61, b2) € U} 
= {G(U) :U € (Bi x Ba)*, (bi, 0) € U or (0, be) € US 
= {G(U) :U € (B, x By)*, by € m4 (U) or by € m(U)} 
= 41 Ng U2 Ne 


Actually Stone goes on to establish relationships between the following pairs: 


Boolean algebras <—> Boolean spaces 
filters «<— closed subsets 
ideals <— open subsets 
homomorphisms <— continuous maps. 


However, what we have done above suffices for our goals, so we leave the other relation- 
ships for the reader to establish in the exercises. 


REFERENCES 


1. P.R. Halmos [18] 
2. M.H. Stone [1937] 


EXERCISES §4 


1. Show that a finite topological space is a Boolean space iff it is discrete (i.e., every 
subset is open). 


158 


10. 


Ld. 


I: 


13. 
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If X is a Boolean space and J is any set, show that the Tychonoff product X% is a 
Boolean space; and if J is infinite and |X| > 1 then (X‘)* is an atomless Boolean 
algebra. 


Show that a countably infinite free Boolean algebra B has a Boolean space homeomor- 
phic to 2”, where 2 is the discrete space {0,1}; hence B is isomorphic to the Boolean 
algebra of closed and open subsets of the Cantor discontinuum. Conclude also that B 
has continuum many nonprincipal ultrafilters. 


Given any set J show that (Su(Z))* is the Stone-Cech compactification of the discrete 
space I. 


Give a topological description of the Boolean space of the algebra of finite and cofinite 
subsets of an infinite set J. 


For B a Boolean algebra and U € B***, show that there is an x € B* with (\U = {z}, 
andU={Ne B™: ace N}-. 


If B is a Boolean algebra and F is a filter of B, show that F* =(\{N,:b€ F} isa 
closed subset of B* and the map F’'+> F™ is an isomorphism from the lattice of filters 
of B to the lattice of closed subsets of B*, and 6 € F iff N, D F*. 


If B is a Boolean algebra and J is an ideal of B, show J* = J{N, : b € I} is an open 
subset of B* such that the map J + I* is an isomorphism from the lattice of ideals of 
B to the lattice of open subsets of B* with b € I iff Ny C I*. 


If a: By — Bg is a Boolean algebra homomorphism, let a* : BS — Bj be defined by 
a*(U) = a~!(U). Show a* is a continuous mapping from B3 to Bz which is injective if 
q@ is surjective, and surjective if a is injective. 


If a: X,; — Xp, is a continuous map between Boolean spaces, let a* : Xj — Xf be 
defined by a*(N) = a~!(N). Then show a* is a Boolean algebra homomorphism such 
that a* is injective if a is surjective, and surjective if @ is injective. 


Show that the atoms of a Boolean algebra B correspond to the isolated points of B* 
(a point x € B* is isolated if {x} is a clopen subset of B*). 


Given a chain (C,<) define the interval topology on C to be the topology generated 
by the open sets {c € C:c> a} and {c€ C: c< a}, for a € C. Show that this gives 
a Boolean space iff (C,<) is an algebraic lattice (see I§4 Ex. 4). 


If \ is an ordinal, show that the interval topology on A gives a Boolean space iff X is 
not a limit ordinal. 
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14. Given Boolean spaces Xj,...,X, such that X;9 X; = {x} for i 4 j, show that the 
space Y = Un<icn X; with open sets {Ul<icn U; : U; open in X;,x belongs to all or 
none of the U;’s} is again a Boolean space. 


85. Boolean Powers 


The Boolean power construction goes back at least to a paper of Arens and Kaplansky in 
1948, and it has parallels in earlier work of Gelfand. Arens and Kaplansky were concerned 
with rings, and in 1953 Foster generalized Boolean powers to arbitrary algebras. This con- 
struction provides a method for translating numerous fascinating properties of Boolean alge- 
bras into other varieties, and, as we shall see in 87, provides basic representation theorems. 


Definition 5.1. If B is a Boolean algebra and A an arbitrary algebra, let A[B]* be the set 
of continuous functions from B* to A, giving A the discrete topology. 


Lemma 5.2. If we are given A,B as in 5.1, A[B]* is a subuniverse of AX, where X = B*. 


PrRooF. Let c,...,¢n € A[B]*. As X is compact, each c; has a finite range, and, for 
a € A, c;'(a) is a clopen subset of X. Thus we can visualize a typical member of A[B]* as 
in Figure 28, namely a step function with finitely many steps, each step occurring over a 
clopen subset of X. If A is of type F and f € F,, then if we choose clopen subsets Ni,..., Nx 
which partition X such that each c; is constant on each Nj,i = 1,...,n,j = 1,...,k, it is 
clear that f(ci,...,¢n) is constant on each N;. Consequently, f(c1,...,¢,) € A[B]*. 


Figure 28 


Definition 5.3. Given A,B as in 5.1, let A[B]* denote the subalgebra of A*, X = B*, 
with universe A[B]*. A[B]* is called the (bounded) Boolean power of A by B. (Note that 
A[1]* is a trivial algebra.) 


Theorem 5.4. The following results hold for Boolean powers: 
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a) A[B]* is a subdirect power of A. 

b) A can be embedded in A|B]* if B is not trivial. 
c) A[2]* = A. 

d) A[B, x Bo|* a A[B,]* x A[B,]*. 

e) (Ay x A>)|B]* a A,|B]* x A.[B]*. 


PrRooF. For (a) and (b) note that the constant functions of A* are in A[B]*. (c) follows from 
noting that 2* is a one-element space, so the only functions in A* are constant functions. 

Let C(X, A) denote the set of continuous functions from X to A, for X a Boolean space, 
and let C(X,A) denote the subalgebra of A* with universe C(X,A). Given two disjoint 
Boolean spaces X1, X2 define 


a: C(x, U Xo, A) —= C(X1, A) x C(Xo, A) 
by 
ae= (ele cel se): 
As X,,X» are clopen in X,; U X2 it is not difficult to see that a@ is a bijection, and if 
C1,-++3€n € C(X, U Xo, A) and f is a fundamental operation of arity n, then 


(cr, fe Ga) ber baer a aie) x2) 


(cr ber +125 €n [Bais fla er +125 €n x2) 


OFC lh) = 


so @ is an isomorphism. As 


A[B]* = C(B*, A) 


it follows from 4.8 that 
A[B, x Bo|* = A[B,]* x A[B)]*. 


This proves (d). 
Next define 
a: A,([B]* x A,|[B]* = (Ay x A:)|B]* 
by 
a((c1, C2)) (a) = (cx, C2). 
Clearly this is a well-defined injective map. If c € (A; x A2)[B]* let Ni,..., N, be a partition 
of B* into clopen subsets such that c is constant on each N;. Then let 


ci(@) = (mc)(2), 
i= 1,2. Then c; € A;[B]* as c; is constant on each N;, and 


a((c1, C2) =¢, 
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so a is surjective. If (c),c}) € A,[B]* x Ay[B]*, 1 < j <n, and if f is a fundamental n-ary 
operation then 


af ((ct, €2),---, (ef, ¢3)) (x) = a((F (et, -.- ef), F(ea,---63)))(2) 
= (f(c1,.-.. ef) (2), f(4,---, 4) @) 
SOC ie Gese Of Cole eee) 
= f((cqa, cox), ..., (etx, ex) 
= f(a((ct, €2))(z),-.-,a((er, ¢))(2)) 
= f (act, ),---, act, &)) (x); 
hence 
af (ci, C3), +++ (ef, )) = F(a(cq, c2),---, (ct, €2)) 


This proves 


as a is an isomorphism. 


The next result is used in §7, and provides the springboard for the generalization of 
Boolean powers given in 88. 


Definition 5.5. If a,b € [],-, A; the equalizer of a and b is 


[a = d] = {6 ET: alt) = b@)}; 


and if J;,...,J, partition I and aj,...,@n € [],<, Ai then 


tel 


ai[z, U---Uanly, 


denotes the function a where 
a(7) = a(t) if 2€ Jp. 


Theorem 5.6. Let B be a Boolean algebra and A any algebra. With X = B*, a subset S 
of A* is A[B]* iff S satisfies 

(a) the constant functions of A* are in S, 

(b) for c1,¢2 € S, [c, = ca] ts a clopen subset of X, and 

(c) for c1,c2 € S and N a clopen subset of X, 


etn U co[x_ne€ S. 


PROOF. (=) We have already noted that the constant functions are in A[B]*. For part (b) 
note that c € A[B]* implies c~'(a) is clopen for a € A as c is continuous. Also as c has finite 
range, 

[c1 = co] Sees cy (a) Ncy*(a) 


acA 
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is a clopen subset of X. Finally 
c= c/n U colx-n 
is in A[B]* as 
oa) = (e¢'(a) AN) U (e5*(a) 1 (X —N)), 
a clopen subset of X for a € A. 
(<=) Fora € A let cy € A* be the constant function with value a. From (b) we have, for 


ce 8S, 
E"@) = [c = Cals 


a clopen subset of X; hence c is continuous, so c € A[B]*. Finally, if c € A[B]* let 


for a € A. Then 


CS U Cal Nas 


acA 


so by (c), cE S. 
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EXERCISES 85 


1. Given Boolean algebras B,, Bz, define B; * By to be (Bj x B3)*. Show that for any 
A, (A[B,]*)[B2)* = A[B; * B.]*; hence (A[B,]*)[B2)* = (A[B»]*)[Bi]*. 


2. If F is a filter of B, define 07 on A[B]* by (a,b) € Op iff [a = b] D F* (see §4 Ex. 7). 
Show that A[B]*/@- = A[B/F)*. 


3. Show that |A[B]*| = |A]|- |B] if either |A] or |B| is infinite, and the other is nontrivial. 


4. (Bergman). Let M be a module. Given two countably infinite Boolean algebras B,, Bz 
show that M[B,|* = M[B,]*. (Hint: (Lawrence) Let Q; be an ordered basis (see §2 
Ex. 7) for B;, i = 1,2, and let a: Qi — Q2 be a bijection. For a € M and q € Qi, let 
Cu!_ denote the member of M[B;]* with 


a ifxeN, 


Ce er 
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Then each member of M[B,]* can be uniquely written in the form Co, [g, ++--+Can lan: 
where qi <---< Qn, q; € Q;. Define @ : M[B,|* — M[B,]* by 


Cay lon Sports Gas Pe Cay ern sa a lon, 


where q < +++ < qn. Then ( is the desired isomorphism. ) 


Show that we can replace M by any algebra A which is polynomially equivalent to a 
module. 


86. Ultraproducts and Congruence-distributive 
Varieties 

One of the most popular constructions, first introduced by Los (pronounced “wash”) in 

1955, is the ultraproduct. We will make good use of it in both this and the next chapter. 


The main result in this section is a new description due to Jonsson, using ultraproducts, of 
congruence-distributive varieties generated by a class K. 


Definition 6.1. For any set J, members of Su(J)* are called ultrafilters over I. Let A;, i € I, 
be a family of algebras of a given type, and let U be an ultrafilter over J. Define @y on [],<; Ai 
by 

(a,b) €6y iff [a=d] EU, 


where [a = 0] is as defined in 5.5. 
Lemma 6.2. With A;, 1€ I, and U as above, Oy is a congruence on They A;. 


PROOF. Obviously, 6y is reflexive and symmetric. If 


(a,b) € @y and (b,c) € Oy 


then 
[a =c] D [a =d] N[b=c] 
implies 
[a =c] €U, 
so 
(a,c) € Oy. 
li 


(a1, bi), aes Os oe E Oy 


and f is a fundamental n-ary operation then 


LF CGA 25-Gin) =F Pigaena yl ») [a1 =i] N---A Jan = by] 
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implies 


(I (Giksd Gals Ff OipacasOn)) € Oy. 


Thus 6y is a congruence. 


Definition 6.3. With A;, 2 € J, and U an ultrafilter over I, we define the ultraproduct 


][4/v 


tel 


[[ A:/v. 


wel 


The elements of [[,-; Ai/U are denoted by a/U, where a € [J,<; Ai. 


to be 


Lemma 6.4. For a/U, b/U in an ultraproduct [],-; Ai/U, we have 


wel 


a/U=b/U iff [a=b] €U. 


PROOF. This is an immediate consequence of the definition. 


Lemma 6.5. If {A; : 7 € I} is a finite set of finite algebras, say {B,,...,B,}, (J can be 
infinite), and U is an ultrafilter over I, then [],-, A;/U is isomorphic to one of the algebras 
B,,..., By, namely to that B; such that 


we. 


{iel:A;=B}eU. 


PROOF. Let 
S,=4¢61 2A, = B,) 
Then 
I=S$,U-:-US), 


implies (by 3.13) that for some J, 
Oe Us 
Let B; = {bi,...,b¢}, where the b’s are all distinct, and choose ay,...,a, € [],<, Ai such 
that 
ay(7) = bi, an , ax (2) = by 
Aj; 


if i € S;. Then if we are given a € [],<; 


[a =aJU---Ula=a;] 2 S;, 
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so 
[a=a] €U or... or [a= a] € UV; 
hence 
Of Saif Of ass, or a7 = aE: 
Also it should be evident that a1/U,...,a,/U are all distinct. Thus [[,.; A;/U has exactly 
k elements, a,/U,...,a,/U. Now for f a fundamental n-ary operation and for 
1 igs, OR Op os PC LOT a5 Det 
with 
f (bi, rea) bi, ) = er 
we have 
[i (aes oot Gig) = ins] = ore 
hence 


Tl Gad, x 30j (OC) Se U- 
Thus the map 
Qa: [[4/v — B; 
i€l 
defined by 
a(a;/U) = bi, 


1<t<k, is an isomorphism. 


Lemma 6.6 (Jonsson). Let W be a family of subsets of I(A @) such that 
Gi Dew, 
(ii) of JEW and J CK CI then K € W, and 
(iii) if Ji UJg © W then J, © W or Jo © W. Then there is an ultrafilter U over I with 
UCW. 


ProoFr. If @ € W then W = Su(J), so any ultrafilter will do. If @ ¢ W, then Su(J) — W is 
a proper ideal; extend it to a maximal ideal and take the complementary ultrafilter. 


Definition 6.7. We denote the class of ultraproducts of members of K by Py(I). 


Theorem 6.8 (Jonsson). Let V(K‘) be a congruence-distributive variety. If A is a subdirectly 
irreducible algebra in V(K), then 
A € HSPy(k); 
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hence 
V(K) =IPsHSPy(K). 


PROOF. Suppose A is a nontrivial subdirectly irreducible algebra in V(A’). Then for some 
choice of A; € K, i € J, and for some B < [],_, A; there is a surjective homomorphism 


i€l 
a:B-A, 
as V(K) = HSP(K). Let 
6? = kera. 


For J C I let 


2 
6; = ¢ (a,b) € (1) :-JCla=)] 
il 
One easily verifies that 0; is a congruence on [[,.,; Ai. Let 
Ostp =O;N B? 
be the restriction of 6; to B, and define W to be 


Clearly 
Iew, SEW 
and if 
JEW and JCKCI 
then 
Oxle C9, 

as 

Oxle COslB. 
Now suppose 

J, U Jn € W, 
164 

9 j,UJ, lB CS @. 
As 


Oj,UIg = O7,N G5, 


it follows that 
(9.,uJ2) lB = 07, lB N 97, IB : 
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Since 


0=6V (87,18 A 83 1B) 


it follows that 
0 = (OV 87, [B) (8 V 3, tz) 


by distributivity, and as Theorem IT§6.20 gives 


Con B/@ = [6, V] 
< Con/B 


we must have from the fact that B/@ is subdirectly irreducible (it is isomorphic to A) 
6=0V6;1p 
for 2 = 1 or 2; hence 
05, Ip Cc 6 


for 1 = 1 or 2, so either J, or Jy is in W. By 6.6, there is an ultrafilter U contained in W. 
From the definition of W we have 
bute Cé 


as 
Oy =| {9s : J € US. 


Let v be the natural homomorphism from [[,., A; to [[,-; Ai/U. Then let 


il 
B:B—v(B) 
be the restriction of vy to B. As 
ker 6 = Oy |p 
Cé 
we have 
A=>B/0 
= (B/ker 3)/(0/ ker 3). 
Now 
B/ker 8 = v(B) < [] Ai/U 
il 
6) 
B/ker GB € ISPy(k); 
hence 


A € HSP,(K). 
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As every algebra in V(J) is isomorphic to a subdirect product of subdirectly irreducible 
algebras, we have 
V(K) =IPsHSPy(K). 


One part of the previous proof has found so many applications that we isolate it in the 
following. 


Corollary 6.9 (Jénsson’s Lemma). If V is a congruence-distributive variety and A; € V, i € 
I, if B < [],-_, Ai, and 6 € Con B is such that B/6 is a nontrivial subdirectly irreducible 
algebra, then there is an ultrafilter U over I such that 


bulep Ceé 


where Oy is the congruence on [],-, A; defined by 


wel 


(a,b)€ Oy iff fa=d] EU. 


Corollary 6.10 (Jénsson). If K is a finite set of finite algebras and V(K) is congruence- 
distributive, then the subdirectly irreducible algebras of V(K) are in 


HS(k), 


and 
V(4K) = TPs(AS(4)). 


ProoFr. By 6.5, Pu(K) C I(K), so just apply 6.8. 


REFERENCES 
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EXERCISES 86 


1. An ultrafilter U over a set I is free iff (}U = @. Show that an ultrafilter U over J is 
free iff J is infinite and the cofinite subsets of J belong to U. 


2. An ultrafilter U over I is principal if (\U #4 @. Show that an ultrafilter U is principal 
if U ={J CI:i€ J} for someie I. 
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87. 


If U is a principal ultrafilter over J and A;, 7 € J, is a collection of algebras, show that 
Ile, Ai/U = A; where ()U = {j}. 


Show that a finitely generated congruence distributive-variety has only finitely many 
subvarieties. Show that the variety generated by the lattice Ns has exactly three 
subvarieties. 


(Jonsson) If A;, Ag are two finite subdirectly irreducible algebras in a congruence- 
distributive variety and A; 4 Ao, show that there is an identity p % q satisfied by one 
and not the other. 


Given an uncountable set J show that there is an ultrafilter U over J such that all 
members of U are uncountable. 


Show that for J countably infinite there is a subset S of the set of functions from 
I to 2 which has cardinality equal to that of the continuum such that for f # g 
with f,g € S, {i € I: f(i) = g(t)} is finite. Conclude that |A’/U| > 2% if U is a 
nonprincipal ultrafilter over J and |A| is infinite. 


Primal Algebras 


When Rosenbloom presented his study of the variety of n-valued Post algebras in 1942, he 
proved that all finite members were isomorphic to direct powers of P,,, (see II§1), just as in the 
case of Boolean algebras. However, he thought that an analysis of the infinite members would 
prove to be far more complex than the corresponding study of infinite Boolean algebras. Then 
in 1953 Foster proved that every n-valued Post algebra was just a Boolean power of P,,. 


Definition 7.1. If A is an algebra and 


f: A" =A 


is an n-ary function on A, then f is representable by a term if there is a term p such that 


F@isss- (GQ) HO" (Gis 2) 


for aj,...,@n € A. 


Definition 7.2. A finite algebra A is primal if every n-ary function on A, for every n > 1, 
is representable by a term. 


In 810 we will give an easy test for primality, and show that the Post algebras P,, are 
primal. However, one can give a direct proof. A key tool here and in later sections is the 
switching function. 


170 IV Starting from Boolean Algebras ... 


Definition 7.3. The function 
s:AtsA 
on a set A defined by 
c ifa=b 


PDC . ifaZb 


is called the switching function on A. A term s(x, y, u,v) representing the switching function 
on an algebra A is called a switching term for A. 


Theorem 7.4 (Foster). Let P be a primal algebra. Then 


V(P) = /{P|B]* : B is a Boolean algebra}. 


PROOF. We only need to consider nontrivial P. If EF is an equivalence relation on P and 


(a,b) ¢ E, 
(c,d) EE 


with c 4 d, then choose a term p(x) such that 


plc) =a, 

p(d) =b 
Thus 

E ¢ Con(P); 


hence P is simple. Also the only subalgebra of P is itself (as P is the only subset of P closed 
under all functions on P). As P has a majority term, it follows that V(P) is congruence- 
distributive, so by 6.8 and the above remarks 
V(P) =IPsHSPy(P) 
= IPs(P) U {trivial algebras}. 
Thus we only need to show every subdirect power of P is isomorphic to a Boolean power of 


P. Let 
A<P! 


be a nontrivial subdirect. power of P. Recall that for p,,p2 € P! we let 
[pi = po] = {1 € F: pit) = po(2)}. 


In the following we will let s(x, y, u,v) be a term which represents the switching function on 
P. 
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Claim i. The constant functions of P! are in A. 
This follows from noting that every constant function on P is represented by a term. 


Claim ii. The subsets [a, = ag], for a1,a2 € A, of I form a subuniverse of the Boolean 
algebra Su(I). 


Let c,,c2 be two elements of A with 
[e1 = Ca] = 


(such must exist as we have assumed P is nontrivial). Then for ay, a2, b;, bp € A the following 
observations suffice: 


c, = c] 


Ss a1, a2, b1, bz) = = by] 


ler 
[a1 = ay] U ae 
[a1 7 pa [1 = mt bo] at [s(a, a2, bi, a1) = s(a1, a2, bo, az)| 

— [a, = ag] = [s(a1, a2, C1, C2) = Co]. 
Let B be the subalgebra of Su(/) with the universe 
{]a1 = a9] : a1, a2 € A}, 
and let 
XS Be. 
Claim iii. Fora € A andU € X there is exactly one p € P such that a~'(p) € U. 


Since P is finite this is the an easy consequence of the facts 


U is an ultrafilter, and the a~'(p)’s are pairwise disjoint. 
So let us define 0 : A x X — P by 


o(a,U)=p iff a‘(p) €U. 
Then let us define a: A > P* by 
(aa)(U) = o(a,U). 


Clearly all the constant functions of P* are in vA (just look at the images of the constant 
functions in A). 
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Claim iv. For a,b € A, [aa = ab] = {U € X: [a = b] € VU}. 
To see this we have 


[aa = ab] = {U € X : (aa)(U) = (ab)(U)} 
={UEX:o(a,U) =a(b,U)} 
={UEX:a'(p) €U,b"'(p) EU for some p € P} 
={UExX:|[a=)] €U} (why’?). 


Thus a typical clopen subset of X is of the form [aa = ab]. 
Next for a},a2 € A and N a clopen subset of X, choose 6;,b2 € A with 


N={UEX: [bi = bo] € VU}. 


Let 
a =a) U a2/1-u 
where 
M = [bi = by] 
Then 
acA 
as 
a= (by, bo, a1, a2). 
Now 
[aa =aaq]={U EX: la=a] € VU} 
D{UEX:MeEU} 
= N, 
and 
[aa = aa] = {U € X: [a = a] € UV} 
2 {UEX:I-MeEU} 
=X —N; 
hence 


aa =aa,;y U aag/x_n . 


Then by 5.6 we see that 
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The map a is actually a bijection, for if a;,a2 € A with 
a, F a2 
then choosing, by 3.15(b), U € X with 
[a1 = ay] ¢ U, 


we have 
(aai)(U) # (aaz)(U). 


Finally, to see that a is an isomorphism, let a,,...,a, € A, and suppose f is an n-ary 
function symbol. Then for U € X and p such that 


Cf Gij2tie oO) =p 


we can use 
Fla1,.-.4n) "= [J af’) n---MaZ\(pn) 
piEP 
f(p15.-5Pn)=p 
and 


HG uxt (Deu 
to show that, for some choice of p1,...,Pn with f(pi,...,Pn) =p, 


ay '(p1) N+ a;"(Pn) € U. 


Hence 
a (p)eu, Iircn, 
and thus 
O(a) = pi, 1 ee ee 7) 
Consequently, 
a(f(ai,.--,@n))(U) = o(f(ai,..-, an), U) 
=p 
. TPigea Da) 
= Fear, 0 )ss 4230 (Ons )) 
= f((aa1)(U),..-, (@an)(U)) 
=F (Weiss coa NU). 
sO 
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EXERCISES 87 
1. Show that a primal lattice is trivial. 
2. Show that if B is a primal Boolean algebra, then |B| < 2. 
3. Prove that for p a prime number, (Z/(p),+,-,—,0,1) is a primal algebra. 
4. Prove that the Post algebras P,, are primal. 


5. If Bi, Bz are Boolean algebras and a : By — Bg is a homomorphism, let @ : B}* — 
B3* be the corresponding homomorphism defined by @(Ny) = Nowy. Then, given any 
algebra A, define a* : A[B,|* > A[Bo]* by 


[a*c=c,] =alc=c], forae A. 
Show that a* is a homomorphism from A[B,]* to A[Bg]*. 


6. If P is a primal algebra, show that the only homomorphisms from P[B,|* to P[B»|* 
are of the form a* described in Exercise 5. 


7. If P is a nontrivial primal algebra, show that P[B,|* ~ P[B.]* iff B; = Bo. 


8. (Sierpitiski). Show that any finitary operation on a finite set A is expressible as a 
composition of binary operations. 


88. Boolean Products 


Boolean products provide an effective generalization of the notion of Boolean power. Actually 
the construction that we call “Boolean product” has been known for several years as “the 
algebras of global sections of sheaves of algebras over Boolean spaces” ; however, the definition 
of the latter was unnecessarily involved. 


Definition 8.1. A Boolean product of an indexed family (A,)ccex, X #4 @, of algebras is a 
subdirect product A < |]... Az, where X can be endowed with a Boolean space topology 
so that 


(i) [a = 0] is clopen for a,b € A, and 
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(ii) if a,b € A and N is a clopen subset of X, then 


aln U bl x_n EA. 


We refer to condition (i) as “equalizers are clopen”, and to condition (ii) as “the patchwork 
property” (draw a picture!). For a class of algebras K, let [%(/¢) denote the class of Boolean 
products which can be formed from nonempty subsets of kK. Thus [¢(iv) C Ps(k). 


Our definition of Boolean product is indeed very close to the description of Boolean 
powers given in 5.6. In this section we will develop a technique for establishing the existence 
of Boolean product representations, and apply it to biregular rings. But first we need to 
develop some lattice-theoretic notions and results. 


Definition 8.2. Let L be a lattice. An ideal I of L is a nonempty subset of LZ such that 


(jij)ael, be L,andb<asSbdel, 
(ii)a,jbeTSavoel. 


I is proper if I 4 L, and I is maximalif I is maximal among the proper ideals of L. Similarly 
we define filters, proper filters, and maximal filters of L. 


Parallel to 3.7, 3.8, and 3.9 we have (using the same proofs) the following. 


Lemma 8.3. The set of ideals and the set of filters of a lattice are closed under finite 
intersection, and arbitrary intersection provided the intersection is not empty. 


Definition 8.4. Given a lattice L and a nonempty set X C L, let [(X) denote the least 
ideal of L containing X, called the ideal generated by X, and let F'(X) denote the least filter 
of L containing X, called the filter generated by X. 

Lemma 8.5. For a lattice L and X C L we have 


(XX) ={ae€L:a<aV---+V apy for some ay,...,an € X} 
F(X) ={ae€L:a>ayA---Aady for some ay,...,an € X}. 


In particular if J is an ideal of L and b € L, then 


I(JU {b}) = {ae L:a<jVb for some j € J}. 


Definition 8.6. A lattice L is said to be relatively complemented if for 


a<xb<e 
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in L there exists d € L with 


020, 
bVd=c. 


d is called a relative complement of b in the interval [a, c]. 


Lemma 8.7. Suppose L is a relatively complemented distributive lattice with I an ideal of 
L anda € L—TI. Then there 1s a maximal ideal M of L with 


ICM, aé€M. 


Furthermore, L — M is a maximal filter of L. The same results hold interchanging the words 
ideal and filter. 


PROOF. Use Zorn’s lemma to extend J to an ideal MW which is maximal among the ideals of 
L containing J, but to which a does not belong. It only remains to show that M is actually 
a maximal ideal of L. For b,,b2 ¢ M we have 


acél(MU{b}), i=1,2: 
hence for some c; € M, i= 1,2, 


a<bVc, 
a< bo Vc. 


Hence 


a < (by V C1) /\ (by V C2) 
= (b; A bz) V [(b1 A C2) V (er A b2) V (C1 A €2)). 


As the element in brackets is in IV, we must have 
bj) Abo € M 
as a ¢ M. Thus it is easily seen that L — M is a filter. Now given b,,b2 ¢ M, choose c € M 


with 
Cc < by : 


Then let d; € L be such that 


b; V dy = b; V by, 
bi Ad, =c, 
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ie., dy is a relative complement of b; in the interval [c,b; V bg]. As L — M is a filter and 
c¢éL—M, it follows that d, € M. But then 

by < by V dy 


says 
by € I(M U {bi}); 


hence 
L=1(M U {by}). 


Consequently M is a maximal ideal. 


Lemma 8.8. Jn a distributive lattice relative complements are unique if they exist. 
PROOF. Suppose L is a distributive lattice and 
a<xb<e 


in L. If d; and dz are relative complements of b in the interval [a,c], then 


dy = dy AC 
= (d, Ab) V (di A da) 
=> dy /\ dy. 
Likewise 
dz = dy A do, 
sO 
dy = dy 


Definition 8.9. If L is a relatively complemented distributive lattice with a least element 
0 and a,b € L, then a\b denotes the relative complement of b in the interval [0,a V 0]. 


Lemma 8.10. /f L is a distributive lattice with a least element 0 such that for a,b € L the 
relative complement (denoted a\b) of b in the interval |0,a V b| exists, then L is relatively 
complemented. 


PROOF. Let 


hold in L. Let 
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Then 
bVd=bV (E\b) =<, 


and 
bAd=bA [aV (c\b)| 
=aV [bA (c\b)| 


= a, 


so d is a relative complement of 6 in [a,c]. 


Now we have all the facts we need about relatively complemented distributive lattices, 
so let us apply them to the study of Boolean products. 


Definition 8.11. If A is an algebra, then an embedding 
a:A-> I] A, 
rex 


gives a Boolean product representation of A if a(A) is a Boolean product of the A,’s. 
Theorem 8.12. Let A be an algebra. Suppose L is a sublattice of Con A such that 


(i 
‘ 


(iii 


Ae L, 

the congruences in L permute, 

L is a relatively complemented distributive lattice, and 

for eacha,b € A there is a smallest member Oa, of L with (a,b) € Aap. 


ea Ra Se 


(iv 
Let 
X ={M: M is a maximal ideal of L} U {L}, 
and introduce a topology on X with a subbasis 
{Ngp:9€ L}U{Do:6€ L} 
where 
No={MeExX:0€ M}, 


and 
Dp={MEX:0¢ M}. 


Then X is a Boolean space, |) M is a congruence for each M € X, and the map 


a:A> ][(4/U) 


Mex 
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defined by 
(aa)(M) = a/| JM 


gives a Boolean product representation of A such that 
[aa = ab] = No,,. 


Consequently, 
Ae IT({A/| JM: M € X}). 


PROOF. 
Claim i. The subbasis 
{No :6€L}U{Do:6€ L} 
is a field of subsets of X, hence a basis for the topology. In particular, 
(a) xX = Na, iG) = Dy, 
and for 0,¢€ L, 
(b) No U Nog = None; 


h) X =NoUDs, @=NoM Do. 


PROOF. (a) Clearly 
X= Nx @ = Dy. 
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The proofs below make frequent use of the fact that L — M is a filter of L if M € X — {L}. 


(b) MEN,UNgiff0EM or GEM 
if@Ng@EM 
iff M € Nong. 
One handles (c) similarly. 
(4d) Me DegUDsifO0¢M or 6¢M 
iffOVeE_M 
iff M € Dove. 


One handles (e) similarly. 
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(f) From the statements 


oN (A\d) =A 
A\¢ CO 


9COVdG=6V (A\9) 
it follows, for M € X, that 


@EM or A\GEM 
06¢M or A\¢EM 
@~M or AGEM or OEM. 


The first two give 
0\¢¢M>0¢ManddgeM 


and from the third 
)\¢EM>O0EMorgeM. 


Thus 
I\¢EMSO0EMorgeM. 


(g) This is an immediate consequence of (f). 
(h) (These assertions are obvious.) 
Thus we have a field of subsets of X. 


Claim ii. X is a Boolean space. 


ProoFr. If M,,Mz € X and M, 4 Mo, then without loss of generality let 6 € My, — Mp. 
Then 


ME No, 
My € Dao, 


so X is Hausdorff. From claim (i) we have a basis of clopen subsets. So we only need to 


show X is compact. Suppose 
X =|JMo,U LJ Dy,. 
iel jeJ 
As Le€ X it follows that I ~ @, say ig € I. Let 
Dy = No, Do,, 


and 
Dy = Dg, Do,,,- 
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Then 


Do,, = X 1 De,, 


=|JDe lL) De. 


iel jeJ 
If the ideal of L generated by 
{0,:0€ T}U{o;: 9 € J} 


does not contain 6;,, then it can be extended to a maximal ideal M of L such that 6;,, ¢ M. 


But then 
M € Do, — (U De Ds) . 


ier jes 
which is impossible. Thus by 8.5 for some finite subsets Jp (of J) and Jo (of J) we have 
b= \/ OV Ml oe 
1€Io jEJo 


hence, by claim (i), 
Do, © LU Da, ¥ LJ De. 


1EIo jEJo 
As 

Do, © No, 

Dg c Deg, 
we have 

X= Noi, U D6, 
= No, UJ No U LL) De,, 
i€Io jEJo 


so X is compact. 


Claim iii. @ gives a Boolean product representation of A. 
PROOF. Certainly a is a homomorphism. If a 4 6 in A, then 
{9 EL: (a,b) € 6} 
is a proper filter of L. Extend this to a maximal filter F’ of L, and let 
M=L-F, 
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a maximal ideal of L. Thus 


(a,b) JM 
as 
(a,b) € 0 
for 6 € M. From this follows 
()(J™) =A, 
Mex 


so aA is a subdirect product of the A/\J™M by II88.2. 
For a,b € A we have 
[aa = ab] = {M € X; (a,b) € J} 
={MEX:6%¢€ M} 


ab? 


so equalizers are clopen. 
Next given a,b € A and @ € L we want to show 


(aa) [n, U (ab) lx_-n, € aA. 


Choose ¢ € LE such that 
(a,b) € 6. 
Then 
(a,b) €OVd=OV (A\9), 


so by the permutability of members of L there is a c € A with 


(a,c) € 9, 
(c, b) € f\é. 


and 
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we have 
ac = aaly, U abl p,, 


so @A has the patchwork property. 


Definition 8.13. Given A let 
Spec A= {¢€ Con A: ¢ is a maximal congruence on A} U {V}, 
and let the topology on Spec A be generated by 
{E(a, b)|a,b € A} U {D(a, b)|a, b € A}, 
where 


E(a,b) ={¢€ Spec A: (a,b) € ¢}, 
D(a,b) ={¢ € Spec A: (a,b) ¢ o}. 


Corollary 8.14. Let A be an algebra such that the finitely generated congruences permute 
and form a sublattice L of Con A which is distributive and relatively complemented. Then 
the natural map 


p:A—> [I A/e 


Oe SpecA 


gives a Boolean product representation of A, and for a,b € A, 


[Ga = Bb] = E(a, b). 


Proor. Let M e€ X, X as defined in 8.12. If 


M=L 
then 
(Ju = Ve Spec A. 
If 
i ea Oe 
then for some a,b € A, 
O(a, b) gM, 


sO 


(a, b) ¢|J™. 
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If L) M is not maximal then, for some 6 € Con A, 


LUuco¢v 
and 
| Ju #0. 
But 
6=|J{oeL: dC 5}, 
SO 


I={oeL:¢C9} 


is a proper ideal of LZ such that MC J but M # J. This contradicts the maximality of M. 
Hence M € X implies 


(Ju € Spec A. 
If M,, Mz € X with 
My, 7 Ma, 
then it is readily verifiable that 
JM 4M. 
And for 6 € Spec A, clearly 
{poe L:¢ CO} 


is in X. Thus the map 
a:X — SpecA 


defined by 
oM=|)M 


is a bijection. For a,b € A note that 
o(Nevasy) = o{M EX: O(a, b) E M} 


={(JM:MeX, (a,b) ¢| JM} 
= {0 € Spec A: (a,b) € 6} 
= E(a,b); 


hence ¢ is a homeomorphism from X to Spec A. Thus 


B:A> || A/a 


0€ Spec A 
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gives a Boolean product representation of A where 


[Ga = Bb] = E(a, b). 


EXAMPLE (Dauns and Hofmann). A ring R is biregular if every principal ideal is generated 
by a central idempotent (we only consider two-sided ideals). For r € R let I(r) denote the 
ideal of R generated by r. If a and 6 are central idempotents of R, it is a simple exercise to 
verify 
I(a) V I(b) = I(a + b — ab) 
and 
I(a) A I(b) = I(ab). 


Thus, for R biregular, all finitely generated ideals are principal, and they form a sublattice of 
the lattice of all ideals of R. From the above equalities one can readily check the distributive 
laws, and finally 


I(b)\I(a) = I(b — ab), 


i.e., the finitely generated ideals of R form a relatively complemented distributive sublattice 
of the lattice of ideals of R; and of course all rings have permutable congruences. Thus by 
8.14, R is isomorphic to a Boolean product of simple rings and a trivial ring. (A lemma of 
Arens and Kaplansky shows that the simple rings have a unit element.) 
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EXERCISES 88 


1. If L is a distributive lattice, J is an ideal of L, and a € L — J, show that there is an 
ideal J which contains J but a ¢ J, and L — J is a filter of L. However, show that J 
cannot be assumed to be a maximal ideal of L. 


2. (Birkhoff). Show that if L is a subdirectly irreducible distributive lattice, then |Z| < 2. 
3. Verify the details of the example (due to Dauns and Hofmann) at the end of 88. 


4. Let A be an algebra with subalgebra Ap. Given a Boolean algebra B and a closed 
subset Y of B*, let 
C = {c € A[B]*: c(Y) C Ao}. 


Show that C is a subuniverse of A|B]*, and C € '*({A, Ao}). 
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5. If A is a Boolean product of (A,)xex and Y is a subset of X, let Aly = {aly: a € A}, 
a subuniverse of [[,,-y Aa. Let the corresponding subalgebra be A fy . If N is a clopen 
subset of X, @ 4 N 4 X, show 


A =ATy X Afx_n. 


Hence conclude that if a variety V can be expressed as V = JT%(K), then all the 
directly indecomposable members of V are in I(kx). 


89. Discriminator Varieties 


In this section we look at the most successful generalization of Boolean algebras to date, 
successful because we obtain Boolean product representations (which can be used to provide 
a deep insight into algebraic and logical properties). 


Definition 9.1. The discriminator function on a set A is the function ¢t : A? — A defined 
by 


A ternary term t(x,y, z) representing the discriminator function on A is called a discrimi- 
nator term for A. 


Lemma 9.2. (a) An algebra A has a discriminator term iff it has a switching term (see §7). 
(b) An algebra A with a discriminator term is simple. 


PROOF. (a) (=) Ift(z, y, z) is a discriminator term for A, let s(x, y,u,v) = t(t(z, y, u), t(a, y, v), v). 
(<=) If s(x, y, u,v) is a switching term for A, then let t(z, y, z) = s(x, y, z,2). 
(b) Let s(x, y,u,v) be a switching term for A. If a,b,c,d € A with a 4 b, we have 
(c,d) = (s(a, a,c, d), (a, b,c, d)) € O(a, b); 


hence 


a#b=> O(a,b) =V. 


Thus A is simple. 


Definition 9.3. Let K be a class of algebras with a common discriminator term t(z, y, z). 
Then V(x) is called a discriminator variety. 


EXAMPLES. (1) If P is a primal algebra, then V(P) is a discriminator variety. 
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(2) The cylindric algebras of dimension n form a discriminator variety. To see this let 
C(x) = co(ci(..- (Cn_i(z)...). From §3 Exercise 7 we know that a cylindric algebra A of 
dimension n is subdirectly irreducible iff for a € A, 


atl cla), 
Thus the term t(z, y, z) given by 
[e(x + y) Aa] V [e(a + y)' A Z| 


is a discriminator term on the subdirectly irreducible members. This ensures that the variety 
is a discriminator variety. 


Theorem 9.4 (Bulman-Fleming, Keimel, Werner). Let t(x,y,z) be a discriminator term 
for all algebras in K. Then 


(a) V(K) is an arithmetical variety. 

(b) The indecomposable members of V(K) are simple algebras, and 

(c) The simple algebras are precisely the members of 1S Py(K+), where K+ is K, augmented 
by a trivial algebra. 

(d) Furthermore, every member of V(K) is isomorphic to a Boolean product of simple alge- 
bras, 1.€., 


V(K) = ITS Py(K,). 


Proor. As t(z,y,z) is a 2/3-minority term for K, we have an arithmetical variety by 
11§12.5. Hence the subdirectly irreducible members of V(/‘) are in HS'Py(K) by 6.8. For 


Ape K, vel, Ue Sud@)% and a,b,c € [],., 43,18 
aU =bU: 
then 
t(a/U, b/U,c/U) = t(a, b,c) /U 
=Er 
as 
[t(a, b,c) =c] EU 
since 
[a=b] €U 
and 
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Likewise, 
[a=] gU 
=>I-|[a=b] ¢U 
=> [t(a, b,c) =a] € U; 
hence 


GjU Abs Stal, bf U3c/U) = aj: 


thus ¢ is a discriminator term for |],-; Ai/U. If now 


ier 
B<|[Ai/U 


tel 


then ¢ is also a discriminator term for B. Consequently, all members of SPy(/) are simple 
by 9.2. It follows by 6.8 that the subdirectly irreducible members of V(k‘) are up to iso- 
morphism precisely the members of SPy(A+), and all subdirectly irreducible algebras are 
simple algebras with t(x, y, z) as a discriminator term. 

To see that we have Boolean product representations let 


A € PsSPy(K,), 


say A < [],-,Si,8; € SPy(Ky). Let s(x, y,u,v) be a switching term for SPy(K,) (which 
must exist by 9.2). If a,b,c,d € A and 


[a=] C [e=4] 
then 
(c,d) = (s(a,a,c,d), s(a,b,c,d)) € O(a, b). 
Thus 
(a, b) € {(c,d) : [a = b] C [c = d]} C O(a, d). 
The set 


is readily seen to be a congruence on A; hence 
O(a, b) = {(c, d) : [a = d] C [c = d]}. 


From this it follows that 


§9. Discriminator Varieties 


Let us verify these two equalities. For 2 € J, 


t(a, b,c)(4) = t(b, a, d) (2) 


holds iff 
a(i) = b(¢) and c(t) = d(a); 

hence 

ee, be) = 1 bacd)| = [a =d) rifle=<d], 

(a, b), (c,d) € O(t(a, b,c), t(b, a, d)), 

thus 

O(a, b), O(c, d) C O(t(a, b, c), t(b, a, d)). 
This gives 


O(a, b) V O(c, d) C O(t(a, b,c), t(b, a, d)). 


Now clearly 
(t(a, b, c), t(b, a, d)) € O(a, b) V O(c, d) 


t(a, b, c)O(a, b)t(a, a, c)O(c, d)t(a, a, d)O(a, b)t(b, a, d). 
Thus 
(t(a, b,c), t(b, a, d)) € O(a, b) V O(c, d), 


O(a, b) V O(c, d) = O(t(a, b,c), t(b, a, d)). 
Next, note that 
s(a,b,c,d)(i) = c(t) iff a(t) = b(2) or c(t) = d(i); 
hence 
[s(a, b, c,d) =c] = [a = b] U[c = d]. 
This immediately gives 


O(s(a, b,c, d),c) C O(a, b), O(c, d), 


O(s(a, b,c, d),c) C O(a, b) N O(c, d). 


Conversely, if 
(615 €2) € O(a, b) M O(c, d) 


then 
[a = OJ, [c = d] € [er = ea], 
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so 
[s(a, 6, c,d) = c] = [a = b] U[c =d] 
C le1 = ea], 
thus 
(e1, €2) € O(s(a, b, c,d), c). 
This shows 


O(a, b) N O(c, d) = O(s(a, b,c, d), c). 


The above equalities show that the finitely generated congruences on A form a sublattice 
L of Con A, and indeed they are all principal. As V(i) is arithmetical L is a distributive 
lattice of permuting congruences. Next we show the existence of relative complements. For 
a,b,c,d € A note that 


O(c, d) /\ O(s(c, d, a, b), b) = O(s(c, d, s(c, d, a, b), b), s(c, d, a, b)) 
=X 


as one can easily verify 


and 


O(c, d) V O(s(c, d, a, b), b) = O(t(c, d, s(c, d, a, b)), t(d, c, b)) 
= O(t(c, d, a), t(d, c, b)) 


(just verify that both of the corresponding equalizers are equal to [c = d] M [a = 6]); hence 
= O(a, b) V O(c, d). 
Thus 
O(a, b)\O(c, d) = O(s(c, d, a,b), b), 


so L is relatively complemented. 
Applying 8.14, we see that A € IT*SPy(K,). 
Note that if a variety V is such that V = JT*(K) then Vp; C I(K), where Vp; is the 
class of directly indecomposable members of V. 
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EXERCISES 89 


1. (a) Show that the variety of rings with identity generated by finitely many finite fields 
is a discriminator variety. (b) Show that the variety of rings generated by finitely many 
finite fields is a discriminator variety. 


2. If A is a Boolean product of an indexed family A,, « € X, of algebras with a common 
discriminator term, show that for each congruence @ on A there is a closed subset Y 
of X such that 

6={{a,b) €eAXA:Y C [a = O]}, 


and hence for 6 a maximal congruence on A there is an x € X such that 


6 = {(a,b) € Ax A: a(x) = b(z)}. 


3. If Ay, Ay are two nonisomorphic algebras with A, < A», and with a common ternary 
discriminator term, show that there is an algebra in [*({A,, Ay}) which is not isomor- 
phic to an algebra of the form A,[B,]* x A2[Bo]*. 


The spectrum of a variety V, Spec (V), is {JA]: A € V, A is finite}. 


4. (Gratzer). For S a subset of the natural numbers, show that S is the spectrum of 
some variety iff1 € S andm,né€S>m-ne€S. (Hint: Find a suitable discriminator 
variety. | 


5. (Werner). Let R be a biregular ring, and for a € R let a* be the central idempotent 
which generates the same ideal as a. Show that the class of algebras (R,+,-,—,0,*) 
generates a discriminator variety, and hence deduce from 9.4 the Dauns-Hofmann the- 
orem in the example at the end of 88. 


810. Quasiprimal Algebras 


Perhaps the most successful generalization of the two-element Boolean algebra was intro- 
duced by Pixley in 1970. But before looking at this, we want to consider three remarkable 
results which will facilitate the study of these algebras. 


Lemma 10.1 (Fleischer). Let C be a subalgebra of A x B, where A,B are in a congruence- 
permutable variety V. Let A’ be the image of C under the first projection map a, and let B’ 
be the image of C under the second projection map 3. Then 


C = {(a,b) € A’ x B’: a’(a) = B'(b)} 


for some surjective homomorphisms a! : A' + D, 2’: B’ > D. 


192 IV Starting from Boolean Algebras ... 


Proor. Let 6 = keralc V ker(@fc, and let v be the natural map from C to C/6. Next, 
define 
a’: A'>C/6 


to be the homomorphism such that 

v=a'oalg 
and 

B':B’ =C/é 
to be such that 

v= f'oBlc. 
(See Figure 29.) Suppose c € C. Then 


c = (ac, Bc) € A’ x B’ 
and 


a'(ac) = ve 
= 8'(Be), 
c € {(a,b) € A’ x B’: a’(a) = B'(b)}. 


Conversely, if 
(a,b) € A’x B’ and a’(a) = f(b) 


let C1, co € C with 
a(ci) =a, B(co) =b. 
Then 
v(c,) = a’a(cy) 
= a'(a) 
= 3"(b) 
= B'(Bc2) 
= v(c2), 
SO 
(C1, C2) € 8; 


hence (c1, C2) € kera o ker 3 as C has permutable congruences. Choose c € C' such that 


c1(ker a)c(ker 3)c2. 
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Then 
a(c) =a(c1) =a, 
Ble > (C2) = 6, 
SO 
c = (a,b); 
hence 
(a,b) EC. 


This proves 


AXB 


C/0 
Figure 29 


Corollary 10.2 (Foster-Pixley). Let $;,...,S, be simple algebras in a congruence-permutable 
variety V. If 


C<8,x---xS, 


is a subdirect product, then 


for some {i1,..., ix} C {1,..., nm}. 
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PROOF. Certainly the result is true ifn = 1. So suppose m > 1 and the result is true for all 
n <m. Then C is isomorphic in an obvious way to a subalgebra C* of (81 x-- +X Sm—1) X Sin- 
Let 


A=S, X--- X Sm-1, 
B=5,: 

Let 

a’: A'’—D, 

2':B' =D 
be as in 10.1. (Of course B’ = B.) As 2’ is surjective and B’ is simple, it follows that D is 
simple. 

If D is nontrivial, then (’ is an isomorphism. In this case 
C* = {(a,b) € A’ x B’: aa = 3'b} 
implies 
C* = {(a, 8’ "a’a) :a€ A’t, 


sO 
A'=C 


under the map 


at (a, B’"a'a) 


(just use the fact that @’~'a’ is a homomorphism from A’ to B’), and hence C & A’. As 
DE OG sae Gs 


is a subdirect product, then the induction hypothesis implies C is isomorphic to a product 
of some of the §;, 1 <i <™m. 
The other case to consider is that in which D is trivial. But then 


C* = {(a,b) € A’ x B': a’a = G'b} 
=A < B’ 
so 
C2A’xB. 
As A’ is isomorphic to some product of the S; and B’ is isomorphic to S,,, we have C 
isomorphic to a product of suitable S,’s. 


Definition 10.3. Let f be a function from A” — A. Define f on A? by 
f({aa, bi), ay (An, bn)) ms (f(a, ear) Qn), f(b1, ae) bn))- 
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For an algebra A we say f preserves subalgebras of A? if, for any B < A?, 
f(B") CB, 
i.e., B is closed under f. 


Lemma 10.4 (Baker-Pixley). Let A be a finite algebra of type F with a majority term 
M(a,y,z). Then for any function 


f:A° A, mee 


which preserves subalgebras of A? there is a term p(21,...,2n) of type F representing f on 
A. 


PROOF. First note that for B < A we have 
f(B") CB 


as 


C= {(b,b): be B} 


is a subuniverse of A?; hence 


f(O") CC, 
i.e., if we are given b),...,b, € B there is a b € B such that 


f((b1, 01), teey (On; bn)) a (6, b). 


But then 
FO ada) = bs 
Thus given any n-tuple (a),...,@,) € A” we can find a term p with 
DiGi kee) Oy) = fins tas Ge) 
as 


F(ai,.--,@n) € Sg({ai,..-,an}) 
(see 11§10.3). Also given any two elements 


COs rex ell PENDS ss scle Dey E A”, 


we have 
f(a, bi), Pee (Qn, bn)) = Se({(a1, bi), raat ay (Qn; OnE) 


hence there is a term q with 


q({a1, bi), pasar) (Qn, bn) = f((a, bi), ee) (Qn, Bn); 
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so 
Giga. i On) = 7 (Gigvasg an) 
and 


q(bi, Oe «4 Dy) = f(b, ake m0 I 


Now suppose that for every k elements of A”, k > 2, we can find a term function p which 
agrees with f on those k elements. If k 4 |A|", let S be a set of k+1 elements of A”. Choose 
three distinct members (aj,...,@n), (b1,---,0n), (C1,.--,€n) of S, and then choose terms 
Pi; P2, p3 Such that p; agrees with f on the set S — {(a1,...,@n)}, etc. Let 


Dl Pia cntes dpe SADT a eg yo ide oe eS kn os a) 


Since for any member of S at least two of p1,p2,p3 agree with f, it follows that p agrees 
with f on S. By iterating this procedure we are able to construct a term which agrees with 
f everywhere. 


Definition 10.5. An algebra S is hereditarily simple if every subalgebra is simple. 
Definition 10.6. A finite algebra A with a discriminator term is said to be quasiprimal. 


Theorem 10.7 (Pixley). A finite algebra A is quasiprimal iff V(A) is arithmetical and A 
is hereditarily simple. 


PRooF. (=) In §9 we verified that if A has a discriminator term then A is hereditarily 
simple and V(A) is arithmetical. 

(<=) Let t : A? > A be the discriminator function on A. Since V(A) is arithmetical, 
it suffices by 10.4 and II§12.5 to show that ¢ preserves subalgebras of A?. So let C be a 
subalgebra of A?. Let A’ be the image of C under the first projection map, and A” the 
image of C under the second projection map. By 10.1 there is an algebra D and surjective 
homomorphisms 


a’: A'’—D, 

3’: A" =D 
such that 

C = {{a',a") € A’ x A": a'd' = p'a"}. 
As A is hereditarily simple, it follows that either a’ and (@’ are both isomorphisms, or D is 
trivial. In the first case 
C= {(a', B’*a’a’\ : a’ € A'S, 

and in the second case 

C=A'x A". 
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Now let 
(a’,a’’, (b',b"', (e. cn) Ec A? 


and let C be the subuniverse of A? generated by these three elements. If C is of the form 
{(a',ya') : a’ € A} 


for some isomorphism 


y: A’ A” 
(y was 3’~1a’ above), then 
(aa) wb b> ate ag) 08 


hence 


t({a’,a"), (0, bY), (c,e")) = (t(a’, BY, c’), t(a", BY, c")) 


and in either case it belongs to C. If C’ is 
A’ x A” 


then as 
t((a’, ae, (D', ine es c"\) E {(a’, as), (a’, ey, (c, C. (c, a’\} ie C 


we see that this, combined with the previous sentence, shows t preserves subalgebras of A?. 


Corollary 10.8 (Foster-Pixley). For a finite algebra A the following are equivalent: 


(a) A is primal, 

(b) V(A) is arithmetical and A is simple with no subalgebras except itself, and the only 
automorphism of A is the identity map, and 

(c) A is quasiprimal and A has only one subalgebra (itself) and only one automorphism (the 
identity map). 


PrRooF. (a = b) If A is primal then there is a discriminator term for A so V(A) is 
arithmetical and A is simple by §9. As all unary functions on A are represented by terms, 
A has no subalgebras except A, and only one automorphism. 

(b = c) This is immediate from 10.7. 

(c = a) A? can have only A? and {(a,a) : a € A} as subuniverses in view of the details 
of the proof of 10.7. Thus for f : A” — A, n > 1, it is clear that f preserves subalgebras of 
A’. By 10.4, f is representable by a term p, so A is primal. 
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EXAMPLES. (1) The ring Z/(p) = (Z/(p),+,-,—,0,1) is primal for p a prime number 
as Z/(p) = {1,14 1,...}; hence Z/(p) has no subalgebras except itself, and only one 
automorphism. A discriminator term is given by 


U(x, y, Z) = (x = yy a+ [1 = (x = y)P*) 2%. 


(2) The Post algebraP,, = ({0,1,...,n—1}, V,A,‘, 0, 1) is primal as P, = {0,0',...,0-)}, 
where a) means k applications of ' to a; hence P,, has no subalgebras except P,,, and no au- 
tomorphisms except the identity map. For the discriminator term we can proceed as follows. 
For a,b € Py, 


BK NOY 0: Gk a Sb 


1<k<n 


! 
eee 1 ifaF0. 


i) 
g(a, b) = \ ( \ ava) 


1<j<n-1 \W<k<n 


Thus let 


/ 


Then 


Now we can let 
t(x,y, 2) = lg(, 9) Az] Y [g(g(@, 9), A 2). 
It is fairly safe to wager that the reader will think that quasiprimal algebras are highly 
specialized and rare—however Murskii proved in (6) below that almost all finite algebras are 
quasiprimal. 
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EXERCISES 810 


1. Show that one cannot replace the “congruence-permutable” hypothesis of 10.1 by 
“congruence-distributive”. [It suffices to choose C to be a three-element lattice.] 


2. Show that every finite subdirect power of the alternating group As; is isomorphic to a 
direct power of As. 


3. If V is a congruence-permutable variety such that every subdirectly irreducible algebra 
is simple, show that every finite algebra in V is isomorphic to a direct product of simple 
algebras. 


4. (Pixley). Show that a finite algebra A is quasiprimal iff every n-ary function, n > 1, 
on A which preserves the subuniverses of A? consisting of the isomorphisms between 
subalgebras of A can be represented by a term. 


5. (Quackenbush). An algebra A is demi-semi-primal if it is quasiprimal and each iso- 
morphism between nontrivial subalgebras of A can be extended to an automorphism 
of A. Show that a finite algebra A is demi-semi-primal iff every n-ary function, n > 1, 
on A which preserves the subalgebras of A and the subuniverses of A? consisting of 
the automorphisms of A can be represented by a term. 


6. (Foster-Pixley). An algebra A is semiprimal if it is quasiprimal with distinct nontrivial 
subalgebras being nonisomorphic, and no subalgebra of A has a proper automorphism. 
Show that a finite algebra A is semiprimal iff every n-ary function, n > 1, on A which 
preserves the subalgebras of A can be represented by a term. 


811. Functionally Complete Algebras and 
Skew-free Algebras 


A natural generalization of primal algebras would be to consider those finite algebras A such 
that every finitary function on A could be represented by a polynomial (see I1§13.3). Given 
an algebra A of type F, recall the definition of F¥4 and Ay given in 1813.3. 


Definition 11.1. A finite algebra A is functionally complete if A, is primal, i.e., if every 
finitary function on A is representable by a polynomial. 


In this section we will prove Werner’s remarkable characterization of functionally com- 
plete algebras A, given that V(A) is congruence-permutable. 


Definition 11.2. Let 2, denote the two-element distributive lattice (2,V,/A) where 2 = 
{0,1} and 0 <1. 


200 IV Starting from Boolean Algebras ... 


Lemma 11.3. Let S be a finite simple algebra such that V(S) is congruence-permutable and 
Con(S%) = 27 
forn<w. Then § is functionally complete. 


PROOF. For brevity let F denote Fy s,)(Z, 9, Z). From 1811.10 it follows that F € 1S P(Ss). 
As Sg has no proper subalgebras, F is subdirectly embeddable in S% for some k. Then from 
10.2, we have 
F2S% 
for some n, so by hypothesis 
Con(F) = 2%. 

Thus Con F is distributive, so by II§12.7, V(Ss) is congruence-distributive. Since V(S) 
is congruence-permutable so is V(Sg) (just use the same Mal’cev term for permutability); 


hence V(Ss) is arithmetical. As Ss has only one automorphism, we see from 10.8 that Ss 
is primal, so S is functionally complete. 


The rest of this section is devoted to improving the formulation of 11.3. 
Definition 11.4. Let 0; € Con Aj, 1 <i<n. The product congruence 
6, X +++ X On 
on A, x --- x A, is defined by 
((@1,--+5@n), (b1,.--,0n)) € 01 X +++ X On 


iff 
(ai, b;) E 0; for 1 < 1 < nN. 


(We leave the verification that 6, x --- x 6, is a congruence on A; x --- x A,, to the reader.) 
Definition 11.5. A subdirect product 

B<B,x.::--xB, 
of finitely many algebras is skew-free if all the congruences on B are of the form 

(0, X-++ xO) NB, 


where 6; € Con B,, i.e., the congruences on B are precisely the restrictions of the product 
congruences on B, x --- x By to B. A finite set of algebras {Aj,...,A,} is totally skew-free 
if every subdirect product 

B< B, x-:-x B;, 
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is skew-free, where B; € {Aj,..., An}. 
Lemma 11.6. The subdirect product 
B<B,x.:-:--xB; 


is skew-free iff 
0 = (8V pi) 1---N (AV px) 


for 8 € Con B, where 
pi = (ker 7) NB? 


and 7; 18 the ith projection map on By, x +--+: xX Bg. 
PROOF. (=) Given B skew-free let 6 € Con B. Then 
G= (0; x + x O,) OB? 


for suitable 0; € Con B;. Let 
y;: B— B/(@V pi) 
be the canonical homomorphism, and let 
be the ith projection of B,; x --- x Bx, restricted to B. Then as 
ker @; = pj COV pj = ker y; 
there is a homomorphism 
such that 
y= Q;T;. 


Now for a,b € B we have 


(a,b) €OV p; iff v;(a) = 1;(b) 
iff ajm;(a) = aym;(b) 
iff a,a; = a,b; 
iff (a;,b;) € ker a;; 
hence 
OV pp =(V X--+ x keray xX --» x V) NB’. 


Also since 
(a,b) € pi > a; = 0; 
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it is clear that 
(a,b) COV pi => (ai, bi) € 4; 


hence 
ker a; C 9). 
Thus 
OV pp C(V XX OX + X VIO B?, 
and then 
OC (OV pi) N--- (AV px) 
C(,xX VK XVI AV Xe XV KO) AB? 


so the first half of the theorem is proved. 
(<) For this direction just note that the above assertion 


OV pp =(V X-++ xX kera;y x +++ x V)NB?, 
for 8 € Con B, does not depend on the skew-free property. Thus 


0= (OV pi)N---N(8V px) 
= (keray xX VX) x V) A= A(V x ++ x VX keraz) 9 B? 
= (ker a, x «++ x ker ag) N B’, 


so @ is the restriction of a product congruence. 


Now we can finish off the technical lemmas concerning the congruences in the abstract 
setting of lattice theory. 


Lemma 11.7. Suppose L is a modular lattice with a largest element 1. Also suppose that 
a1, a2 € L have the property: 


c € [a1 Aag, 1] > c= (CV a) A (CV ag). 
Then for any be€ L, 


c € [a; Aaz A b,b] > c= (cV (a, AD)) A (CV (a2 A9)). 
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PROOF. Let c € [a; A a2 A b, 6]. Then 


c=cV(bAa A az) 
= b/A (cV (a; A ay)) 
=b/A(cV ay) A (CV ag) 
= [cV (a1 Ab) A [cV (a2 A 8) 


follows from the modular law and our hypotheses. 


Lemma 11.8. Let L be a modular lattice with a largest element 1. Then if ay,...,@n € L 
have the property 
c € [a; Aa;,1] > c= (eV a) A (CV a;), 


ea <1, tien 


c € [a1 A+++ Aan, 1] > C= (CV a1) A--+ A (CV Gn). 


PROOF. Clearly the lemma holds if n < 2. So let us suppose it holds for all n < m, where 
m > 3. Then for c € [a1 A- ++ A am, 1], 
c=cV (a, Ac) 
=cV {[(ai Ac) V (a, A az2)] A-+- A [(a1 Ac) V (a1 A am) ]}- (x) 
This last equation follows by replacing L by the sublattice of elements x of L such that 


x < a,, and noting that a, A dg,...,a, A Gm satisfy the hypothesis of 11.8 in view of 11.7. 
By the induction hypothesis we have for this sublattice 


a, \c = [(ay Ac) V (a, A a2)] A+++ A [(Q AC) V (a1 A am)). 
Now applying the modular law and the hypotheses to (*) we have 


c=cV {a A [eV (a1 A a2)] A+++ A [eV (a1 A am) } 
=cV {ai A [(cV a1) A (CV aa)} A+++ A [(C V ar) A (CV Gm) $ 
= (cV aj) A---A(CV am). 


This finishes the induction step. 


Lemma 11.9. Let {Aj,...,A,} be a set of algebras in a congruence-modular variety such 
that for any subdirect product D of any two (not necessarily distinct) members, say D < A; x 
A,, the only congruences on D are restrictions of product congruences. Then {Ay,..., An} 
is totally skew-free. 
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PROOF. Let 

B<B,x.::--xB, 
be a subdirect product of members of {Aj,...,A,}, and let 

pi = (ker 7) N B? 


as before. For 1 <i <j < k, B/(p; p;) is isomorphic to a subalgebra of B; x B;, which 
is a subdirect product of B; x B,; obtained by using a projection map on B. From this and 
the correspondence theorem it follows that if 6 € Con B, then 


pip; Ce 


implies 
6 = (OV p)N OV py) 


by our assumption on D above and 11.6. Now we can invoke 11.8, noting that B x B is the 
largest element of Con B, to show that, for 6 € Con B, 


0= (OV pi) N--- (OV pr) 


because p,M---M px is the smallest congruence of B. By 11.6, {Ai,..., A, } must be totally 
skew-free. 


Lemma 11.10. Suppose Aj,...,An belong to a congruence-distributive variety. Then 
{Aj,..., An} is totally skew-free. 


PROOF. For any subdirect product 
B< B,x.:-- x Br, 


where B,,...,B, belong to a congruence-distributive variety, let p; be as defined in 11.6. 
Then for 6 € Con B, 


6=O0VA=OV (p, A---A px) 
= (0V pi) A+++ A (OY pr), 


so B is skew-free by 11.6. Hence {A,,...,A,} is totally skew-free. 


Lemma 11.11. Let P be a nontrivial primal algebra. Then 


Con P? = 25. 


Proor. As V(P) is congruence-distributive, the congruences of P? are precisely the product 
congruences 6; x 62 by 11.10. As P is simple, Con P? is isomorphic to 2%. 
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Theorem 11.12 (Werner). Let A be a nontrivial finite algebra such that V(A) is congruence- 
permutable. Then A is functionally complete iff Con A? & 22. 


PROOF. (=) Suppose A is functionally complete. Note that 
Con A” = Con A‘, 
(adding constants does not affect the congruences). As A, is primal, we have by 11.11, 
Con A? & 2%. 


(<=) As 
Con A? & 27 


again 
Con A’ = 27. 


Thus A4 must be simple (otherwise there would be other product congruences on A), and 
having the constants of A ensures Ay has no proper subalgebras and no proper automor- 
phisms. A (now familiar) application of Fleischer’s lemma shows that the only subdirect 
powers contained in Ay x A, are A%, and D, where 


D = {(a,a):a€ A}. 


The congruences on A% are product congruences since there are at least four product con- 
gruences A x A, A x V,V x A, V x V, and from above 


mm n 
Con A‘, = 27. 


The congruences on D are (V x V) ND? and (A x A)N D? as D & Ay. Thus by 11.9, {Ay} 
is totally skew-free. Consequently, 


mm n 
Con A" = 27, 


so A is functionally complete by 11.3. 


Corollary 11.13 (Maurer-Rhodes). A finite group G is functionally complete iff G is 
nonabelian and simple or G is trivial. 


PROOF. The variety of groups is congruence-permutable; hence congruence-modular. If 
Con G? = 27 


then G is simple. 
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The nontrivial simple abelian groups are of the form Z/(p); and 


|Con(Z/(p) x Z/(p))| > 4 


as 
{(a,a) :a € Z/(p)} 
is a normal subgroup of Z/(p) x Z/(p), so Z/(p) cannot be functionally complete. Hence G 
is nonabelian and simple. 
If G is nonabelian simple and N is a normal subgroup of G?, suppose (a,b) € N with 
a # 1. Choose c € G such that 
cac! # a. 


Then 
(cac™*, b) = (c,b){a, b)(c7*, b7) EN; 


hence 
(cae *a>*, 1) =(eae*, b}(a*,B"") € N. 


As G is simple, it follows that 
(cae *a*, 1) 


generates the normal subgroup ker 72 since 
cac ta! £1, 
so 
ker 72 C N. 


Similarly, 
b#A1>kerm CN. 


If both a,b 4 1, then 
ker 771, ker t2 C N 


implies 
GaN. 


Thus G? has only four normal subgroups, so 


Con G? ~ 23. 


This finishes the proof that G is functionally complete. 
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EXERCISES §11 


1. If A is a finite algebra belonging to an arithmetical variety, show that A is functionally 
complete iff A is simple. 


2. If R,, Re are rings with identity, show that R, x R» is skew-free. Does this hold if we 
do not require an identity? 


3. Describe all functionally complete rings with identity. 

4. Describe all functionally complete lattices. 

5. Describe all functionally complete Heyting algebras. 

6. Describe all functionally complete semilattices. 

7. Show the seven-element Steiner quasigroup is functionally complete. 


8. (Day) Show that a finitely generated congruence-distributive variety has the CEP iff 
each subdirectly irreducible member has the CEP. 


§12. Semisimple Varieties 


Every nontrivial Boolean algebra is isomorphic to a subdirect power of the simple two-element 
algebra, and in 9.4 we proved that every algebra in a discriminator variety is isomorphic to 
a subdirect product of simple algebras. We can generalize this in the following manner. 


Definition 12.1. An algebra is semisimple if it is isomorphic to a subdirect product of 
simple algebras. A variety V is semisimple if every member of V is semisimple. 


Lemma 12.2. A variety V is semisimple iff every subdirectly irreducible member of V is 
simple. 


PROOF. (=) Let A be a subdirectly irreducible member of V. Then A can be subdirectly 
embedded in a product of simple algebras, say by 


a:A-> I] S;. 
ie] 
As A is subdirectly irreducible, there is a projection map 


m:][Si- Si 


tel 


208 IV Starting from Boolean Algebras ... 


such that 7; 0a is an isomorphism. Thus 
A= Sas 


so A is simple. 
(<) For this direction use the fact that every algebra is isomorphic to a subdirect product 
of subdirectly irreducible algebras. 


Definition 12.3. Let A be an algebra and let 0 € Con A. In the proof of II85.5 we showed 
that 6 is a subuniverse of A x A. Let @ denote the subalgebra of A x A with universe 06. 


Lemma 12.4 (Burris). Let A be a nonsimple directly indecomposable algebra in a congruence- 
distributive variety. If@ € Con A is maximal or the smallest congruence above A, then @ is 
directly indecomposable. 


PROOF. We have 
6<AxA. 


By 11.10, @ is skew-free. Thus suppose 
(1 x 62) N 6" 
and 
(dX 3) NO 
are a pair of factor congruences on 6, where ¢;,¢* € Con A, i = 1,2. From 


[(¢1 x b2) 147] © [(d] x 63) N67] = Vo 


it follows that 
i o On = Va; 
i = 1,2. To see this let a,b € A. Then 


((a, a), (b, b)) € 0, 
so for some c,d € A, 
(a, a)[(1 x $2) NO*](e, d)[(G7 x $3) 1.47] (0, 6). 
Thus 


ag,co;), 
ad2d 5b. 
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Next, from 
[(o1 x G2) 167] N [G7 x $3) 167] = Ag 


it follows that 
PNG, NO=Ay 


for 7 = 1,2. To see this, suppose 
(a, b) € Pr Ng NA, 
with a £ b. Then 


((a, b), (b, by) € [(d1 x G2) NO?) [(b7 x 63) NF], 
which is impossible as 
(a,b) # (b, b). 


Likewise, we show 
021 65N8 = Ag. 


Suppose @ is a maximal congruence on A. If 
BNO; FAa 
for 2 = 1,2, then 
AV (biN¢;)=Va 
as 
bi; ZO; 
and 


so #1; is the complement of 6 in Con A,2 = 1, 2. In distributive lattices complements are 
unique, so 


011 GO} = 6229 5. 
Then choose (a,b) € 6, ¢{ with a ¥ b. This leads to 


((a, a), (b, 6)) € [(d1 x $2) NO] A[(G7 x 63) 1", 


which is impossible as 
(a, a) # (b, b). 


Now we can assume without loss of generality that 


og; = Aa. 
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Thus, by the above, ¢1, ¢] is a pair of factor congruences on A. As A is directly indecom- 
posable, we must have 


{o1, Oj} = {Aa, Va}, 


say 
on = Va; ?} = Aa. 
Then 
(d% x 63) = (Ag x 6) NOP 

= [Ay x (¢3N AN; 

hence 
b2 0 (30) = Va. 

As 


$21 (9,98) =Aa 
and A is directly indecomposable we must have 
03N0 = Aa, 
so 


(¢3 x 63)? = (Ag x Ag) N? 
= Ag. 


This shows that 6 has only one pair of factor congruences, namely 
{Ao, Vot; 


hence @ is directly indecomposable. 
Next suppose @ is the smallest congruence in Con A — {Ay}. Then 


ON (O:1¢;) =Axs 
immediately gives 
biG; = Aa, 
so we must have 
{$i 0} = {Aa, Va} 
as 
i909; = Va, 
t= 129i 
bi F 2, 
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say 


on = VA; o>) = INA; 


then 
(o1 x do) NF = (6x Ay) n&, 


which implies 
(d7 x $5) N@? = (Ag x 0) N6?. 


But if (a,b) ¢ @ then 
((a, a), (b, b)) ¢ (8x Aa) NO 0 (Ay x ON G?, 


so we do not have factor congruences. Hence necessarily 


oi = $2, 

o} = 3, 
and this leads to the factor congruences 

{Vo, Ap}, 


so @ is directly indecomposable. 


Theorem 12.5 (Burris). [f V is a congruence-distributive variety such that every directly 
indecomposable member is subdirectly irreducible, then V is semisimple. 


PROOF. Suppose A € V where A is a nonsimple subdirectly irreducible algebra. Let @ be 
the least congruence in Con A—{A}. Note that 6 4 V4. Then @ is a directly indecomposable 
member of V which is not subdirectly irreducible (as 


P.O po = Xo 
where 
pi = (ker 7) N 6”, 
™m%:AxA—-A, 
ea a 
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EXERCISES §12 


1. Let V be a finitely generated congruence-distributive variety such that every directly 
indecomposable is subdirectly irreducible. Prove that V is semisimple arithmetical. 


2. Give an example of a finitely generated semisimple congruence-distributive variety 
which is not arithmetical. 


3. Given A as in 12.4 can one conclude for any congruence @ such that A < @ < V that 
9 is directly indecomposable? 


4. Given A,6 as in 12.4 and B a subuniverse of Su(J/) let A[B,6]* be the subalgebra of 
A! with universe {f € A! : f-t(a) € B, f-'(a)/0 € {9,I}, for a € A, |f(A)| < wt}. 
Show that A[B, 6]* is directly indecomposable. 


813. Directly Representable Varieties 


One of the most striking features of the variety of Boolean algebras is the fact that, up to 
isomorphism, there is only one nontrivial directly indecomposable member, namely 2 (see 
Corollary 1.9). From this we have a detailed classification of the finite Boolean algebras. A 
natural generalization is the following. 


Definition 13.1. A variety V is directly representable if it is finitely generated and has (up 
to isomorphism) only finitely many finite directly indecomposable members. 


After special cases of directly representable varieties had been investigated by Taylor, 
Quackenbush, Clark and Krauss, and McKenzie in the mid-1970’s, a remarkable analysis 
was made by McKenzie in late 1979. Most of this section is based on his work. 


Lemma 13.2 (Pélya). Let c1,...,c; be a finite sequence of natural numbers such that not 
all are equal to the same number. Then the sequence 


SSE eae, Me Us 


has the property that the set of prime numbers p for which one can find an n such that p 
divides 8, 1s infinite. 


PROOF. Suppose that c;,...,c; is such a sequence and that the only primes p such that p 
divides at least one of {s,:n > 1} are pi,...,p,. Without loss of generality we can assume 
that the greatest common divisor of c,,...,¢ is 1. 


Claim. For p a prime and forn >1,k > 1,t < p**}, 


—1)p*-n —1)p*-n 
pit 1 cl? 1)p ai oi te cl? 1)p 
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To see this, note that from Euler’s Theorem we have 


and furthermore 


p | c => c = 0 (mod p) 
=> cht! = 0 (mod p*t!) 
o(pkt") — 0 


: Py. 


=e (mod p 


Let uw be the number of integers 7 € [1, t] such that pf c, ie., cf) = 1 (mod p**"). Then 
u>lasg.c.d.(c,...,c¢) =1. Furthermore, for n > 1, 


GOP ep GOTT =u ( 


CG Ny 


mod p 


Since 1 <u <t < p**1, p*t"+4u, and hence the claim is proved. 
Now if we set 


m = (pit!) ---d(pe**) 


then for n > 1,1 < j <r,t < p**!, the claim implies 


k+1 mn mn 
PRE eR bob em, 


SO 
See pett a eget 

aS p1,...,pr are the only possible prime divisors of Sm,. Thus the sequence (Simn)n>1 is 

bounded. But this can happen only if aj = --- = a; = 1, which is a contradiction. 


Definition 13.3. A congruence @ on A is uniform if for every a,b € A, 
|a/0| = |b/9|. 
An algebra A is congruence-uniform if every congruence on A is uniform. 


Theorem 13.4 (McKenzie). If V is a directly representable variety, then every finite member 
of V is congruence-uniform. 


ProoF. If V is directly representable, then there exist (up to isomorphism) finitely many 
finite algebras D,,...,D, of V which are directly indecomposable; hence every finite member 
of V is isomorphic to some Dj"! x--- x D7". Thus there are only finitely many prime numbers 
p such that p||Al for some finite A € V. 
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Now if A is a finite member of V which is not congruence-uniform, choose 6 € Con A 
such that for some a,b € A,|a/0| 4 |b/6|. For n > 1, let B, be the subalgebra of A” whose 
universe is given by 

B, = {a € A”: a(t)Oa(j) for 0 <i, 7 < n}. 


Let the cosets of 6 be S),...,5; and have sizes c,,...,c respectively. Then 
Bal = eh ton te 


hence by Polya’s lemma there are infinitely many primes p such that for some B,, p| |B,|. As 
B, € SP(A) CV this is impossible. Thus every finite member of V is congruence-uniform. 


Lemma 13.5 (McKenzie). Jf A is a finite algebra such that each member of S(A x A) is 
congruence-uniform, then the congruences on A permute. 


PROOF. Given 6;, 42 € Con A, let B be the subalgebra of A x A whose universe is given by 
B= 0, ie) Ao. 
Let 
= 02 X Oa! B, 


a congruence on B. For a € A, 
a/O2 x a/O2 Cc Ay <= B; 


hence 
(a, a) / = a/02 x a/O2. 
Since A € IS(A x A), both #2 on A and ¢ on B are uniform congruences. If r is the size 


of cosets of 62 and s is the size of cosets of @, it follows that s = r?. Now for (a,b) € B, we 
have 


(a, b)/@ Ss a/O2 x b/02, 
|(a, b)/e| = , 
|a/O2| = |b/O2| =r, 


and s = r?; hence 
(a, b)/¢@= a/O2 x b/02. 


Now for c,d € A, 
(c, d) E A200; 0 0 0 Ay 


iff 
(c,d) € a/02 x b/@2 for some (a,b) € B, 
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sO 
§,00,06,00, C B= 0 Ay; 


hence 
9200, CO; 0 A, 


so the congruences on A permute. 


Theorem 13.6 (Clark-Krauss). Jf V is a locally finite variety all of whose finite algebras 
are congruence-uniform, then V is congruence-permutable. 


Proor. As Fy (Z, Y, Z) is finite, by 13.5 it has permutable congruences; hence V is congruence- 
permutable. 


Corollary 13.7 (McKenzie). If V is a directly representable variety, then V is congruence- 
permutable. 


PROOF. Just combine 13.4 and 13.6. 


Theorem 13.8 (Burris). Let V be a finitely generated congruence-distributive variety. Then 
V is directly representable iff V is semisimple arithmetical. 


Proor. (=) From 12.4 and 12.5, V is semisimple, and by 13.7 V is congruence-permutable. 
Hence V is semisimple arithmetical. 

(<) If V is semisimple arithmetical, then every finite subdirectly irreducible member of 
V is a simple algebra; hence every finite member of V is isomorphic to a subdirect product 
of finitely many simples. Then by 10.2 every finite member of V is isomorphic to a direct 
product of simple algebras. By 6.10 there are only finitely many simple members of V, so V 
is directly representable. 


Theorem 13.9 (McKenzie). [fV = IT°*(K), where K is a finite set of finite algebras, then 
V is congruence-permutable. 


PRooF. As every finite Boolean space is discrete, it follows that every finite member of V 
is in [P(K,); hence V is directly representable, so 13.7 applies. 


A definitive treatment of directly representable varieties is given in [1] below. 
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EXERCISES 813 
1. Which finitely generated varieties of Heyting algebras are directly representable? 
2. Which finitely generated varieties of lattices are directly representable? 
3. If G is a finite Abelian group, show that V(G) is directly representable. 


4. If R is a finite ring with identity, show that V(R) is directly representable if R is a 
product of fields. 


Chapter V 


Connections with Model Theory 


Since the 1950’s, a branch of logic called model theory has developed under the leadership 
of Tarski. Much of what is considered universal algebra can be regarded as an extensively 
developed fragment of model theory, just as field theory is part of ring theory. In this chapter 
we will look at several results in universal algebra which require some familiarity with model 
theory. The chapter is self-contained, so the reader need not have had any previous exposure 
to a basic course in logic. 


81. First-order Languages, First-order Structures, 
and Satisfaction 


Model theory has been primarily concerned with connections between first-order properties 
and first-order structures. First-order languages are very restrictive (when compared to 
English), and many interesting questions cannot be discussed using them. On the other 
hand, they have a precise grammar and there are beautiful results (such as the compactness 
theorem) connecting first-order properties and the structures which satisfy these properties. 


Definition 1.1. A (first-order) language & consists of a set of relation symbols and a set 
F of function symbols, and associated to each member of & [of F] is a natural number [a 
nonnegative integer] called the arity of the symbol. F, denotes the set of function symbols 
in F of arity n, and &,, denotes the set of relation symbols in ® of arity n. & is a language 
of algebras if R = S@, and it is a language of relational structures if F¥ = ©. 


Definition 1.2. If we are given a nonempty set A and a positive integer n, we say that r is 
an n-ary relation on A if r C A”. ris unary if n = 1, binary if n = 2, and ternary if n = 3. 
A relation is finitary if it is n-ary for some n, 1 <n < w. When r is a binary relation we 


frequently write arb for (a,b) € r. 


Definition 1.3. If £ is a first-order language, then a (first-order) structure of type & (or 
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£-structure) is an ordered pair A = (A, L) with A # ©, where L consists of a family R of 
fundamental relations r* on A indexed by & (with the arity of r“ equal to the arity of r, 
for r € R) and a family F of fundamental operations f* on A indexed by ¥ (with the arity 
of f* equal to the arity of f, for f € F). A is called the universe of A, and in practice we 
usually write just r for r“ and f for f4. If R = @ then A is an algebra; if F = @ then A 
is a relational structure. If & is finite, say F¥ = {fi,..., fm}, R = {r1,..., Tn}, then we often 
write 


Ag Fist de oel a hiyiete ta) 


instead of (A, L). 


EXAMPLES. (1) If £ = {+,-,<}, then the linearly ordered field of rationals (Q,+,-,<) isa 
structure of type £. 
(2) If £ = {<}, then a partially ordered set (P,<) is a relational structure of type £. 


Definition 1.4. If £ is a first-order language and X is a set (members of X are called 
variables), we define the terms of type & over X to be the terms of type F over X (see 
11811). The atomic formulas of type & over X are expressions of the form 


pq where p,q are terms of type £ over X 
r(pi,---;Pn) Wherer € R, and pj,...,pn are terms of type £ over X. 


EXAMPLE. For the language £ = {+,-,<} we see that 
(v-y)-zRa-y, (e-y)-zS 4-2 


are examples of atomic formulas, where of course we are writing binary functions and binary 
relations in the everyday manner, namely we write u-v for -(u,v), and u < v for < (u,v). If 
we were to rewrite the above atomic formulas using only the original definition of terms, we 
would have the expressions 


C(t,y),2) * (ey), SCC), 2), 2). 


Definition 1.5. Let £ be a first-order language and X a set of variables. The set of (first- 
order) formulas of type & (or £-formulas) over X, written £(X), is the smallest collection of 
strings of symbols from LUX U {(, )}U{&, V, 73,3, ,V, 4, =} U {, } containing the atomic 
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formulas of type £ over X, and such that if 6, 6;,@. € L(X) then 


The symbols & (and), V (or), — (not), — (implies), and < (iff) are called the propositional 
connectives. V is the universal quantifier, and J is the existential quantifier; we refer to them 
simply as quantifiers. p % q denotes — (p = q). 


EXAMPLE. With £ = {+,-,<} we see that 


(Va(r-yeyt+u)) > Gy@-y<y+u)) 


is in L({2,y, u}), but 
Va(a&y © u) 


does not belong to £({2, y, u}). 


Definition 1.6. A formula ®, is a subformula of a formula ©® if there is consecutive string 
of symbols in the formula ® which is precisely the formula ®1. 


EXAMPLE. The subformulas of 


(Va(r-yxy+u)) > Gy@-y<ytu)) 


are itself, 
Va(z-y@ytu), 
dy(v-y Sy tu), 
and 


e-ysytu. 


Remark. Note that the definition of subformula does not apply to the string of symbols 


(Va(r-yrytu)) > Gy@-ysytu)); 
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for clearly y © y is a consecutive string of symbols in this expression which gives a formula, 
but we would not want this to be a subformula. However if one translates the above into 
the formula 


then the subformulas, retranslated, are just those listed in the example above. 
Definition 1.7. A particular variable x may appear several times in the string of symbols 
which constitute a formula ®; each of these is called an occurrence of x. Similarly we may 


speak of occurrences of subformulas. Since strings are written linearly we can speak of the 
first occurrence, etc., reading from left to right. 


EXAMPLE. There are three occurrences of x in the formula 


(Va(r-yxytu)) > Gy@-ysy+u)). 


Definition 1.8. A particular occurrence of a variable x in a formula © is said to belong to 
an occurrence of a subformula ®, of ® if the occurrence of x is a component of the string of 
symbols which form the occurrence of ®;. An occurrence of x in © is free if x does not belong 
to any occurrence of a subformula of the form Vxz(W) or da(W). Otherwise, an occurrence of 
x is bound in ®. A variable x is free in ® if some occurrence of x is free in ®. To say that 
x is not free in ® we write simply x ¢ ®. A sentence is a formula with no free variables. 
When we write ®(2,...,2,) we will mean a formula all of whose free variables are among 
{21,...,2%n}. We find it convenient to express ®(21,...,%m,Y1,---5Yn)---) by O(@,9,...). If 
x; is free in ®(2,,...,2%,) then this notation is assumed to refer to all the free occurrences 
of x;. Thus, given a formula ®(21,...,2,), when we write 


O(x1, oo Ui-1,Y, Vj41,--- Sees) 
we mean the formula obtained by replacing all free occurrences of x; by y. 


EXAMPLE. Let ®(2,y,u) be the formula 


(Va(r-yeytu)) > Gy@-ysy+u)). 


The first two occurrences of x in ®(z,y,u) are bound, the third is free. ®(2,2,u) is the 
formula 


(Va(a-axa+u)) > Ay@-ysytu)). 


Definition 1.9. If A is a structure of type £, we let £4 denote the language obtained by 
adding a nullary function symbol a to £ for each a € A. Given ®(2,..., 2%) of type £4 and 
a € A, the formula 

O(x1, wee, Ui-1, 4, Vi41,..-.- ey) 
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is the formula obtained by replacing every free occurrence of x; by a. We sometimes refer to 
formulas of type £4 as formulas of type £ with parameters from A. 


When desirable we give ourselves the option of inserting or removing parentheses to 
improve readability, and sometimes we use brackets, |, ] and braces {, } instead of parentheses. 

Next we want to capture the intuitive understanding of what it means for a first-order 
formula to be true in a first-order structure. A precise definition of truth (i.e., definition of 
satisfaction) will allow us to do proofs by induction later on. From now on we will frequently 
drop parentheses. For example we will write 6; & 2 instead of (®;) & (2), and Vrdy® 
instead of Vx(Sy(®)); but we would not write 6; & 6) V ®3 for (®;) & (®2 V ®3). 


Definition 1.10. Let A be a structure of type £. For sentences ® in £4(X) we define the 
notion A — ® (read: “A satisfies ®” or “@ is true in A” or “® holds in A”) recursively as 
follows: 


(i) if ® is atomic: 


(4) A pe sowie) Coons er (age Sas) a es og) 


(b) AE r(ay,...,@n) iff r4(a1,...,G@n) holds in A 


(ii) AE ® & gd iff AF ®, and AF ®, 


(iii) AE ©, V ® iff AE ®, or AF ®, 


(iv) AE -® iff it is not the case that A — ® (which we abbreviate to: A |- ®) 


(v) Af ®, > 9, iff A - ® or AF 


(vi) AF ®, © ®, iff (A A &, and A F ®,) or (A F ®, and A F ®,) 


(vii) A — Va®(x) iff A — ®(a) for everya Ee A 


(viii) A — dr®(x) iff A — ®(a) for somea € A. 


For a formula ® € £La(X) we say 


AE ® 
iff 
ASN Gis VG, 
where 21,...,p are the free variables of ®. For a class K of £-structures and ® € L(X) we 
say 


KE® iff AE® forevery Ac K, 


ae 


and for © a set of £-formulas 


AED 
KES 
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iff AE® forevery BED 
iff KE® forevery BED. 


(If A — © we also say A is a model of J.) Then we say 
“SE® iff AE implies AE®, for every A, 
(read: “XI yields ®”), and 
“SEY iff “E® for every ® € Yy. 


EXAMPLE. A graph is a structure (A,r) where r is a binary relation which is irreflexive and 
symmetric, i.e., for a,b € A we do not have r(a, a), and if r(a, b) holds so does r(b, a). Graphs 
are particularly nice to work with because of the possibility of drawing numerous examples. 
Let A = (A,r) be the graph in Figure 30, where an edge between two points means they 
are related by r. Let us find out if 


A 


= Very arta, z) V ras 2) 


This sentence will be true in A iff the following four assertions hold: 


(i) AF Syv2(r(a, z) V rly, 2)) 
(ii) A E dyVz(r(0, z) V r(y, z)) 
(iii) A F dyVz(r(c, z) V r(y, z)) 
(iv) A = dyVz(r(d, z) V r(y, z)). 
Let us examine (i). It will hold iff one of the following holds: 


(ig) A = V2(r(a, z) V r(a, z)) 

(ip) A F Vz(r(a, z) V r(8, z)) 

(i.) A FE Vz(r(a, z) V r(c, z)) 

(iq) A — Vz(r(a, z) V r(d, z)). 
(i 


(ing) A F r(a,a) V r(b,a) 
(inn) A E r(a, b) V r(b, b) 
(ine) A FE r(a,c) V r(b,c) 
(ing) A F r(a,d) V r(b, d). 
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(ip) is true, hence (i) holds. Likewise, the reader can verify that (ii), (iii), and (iv) hold. 
But this means the graph A satisfies the original sentence. 
It is useful to be able to work with sentences in some sort of normal form. 


d 
a b . 
Cc 
Figure 30 
Definition 1.11. Let ®)(x,...,x7,) and ®o(21,...,2,) be two formulas in £(X). We say 


that ©; and ® are logically equivalent, written ®, ~ ®o, if for every structure A of type £ 


and every @1,...,@n € A we have 


A ES ®1 (a1, ase og les) iff AE ®2(a1, adc gar ) 


If for all £-structures A, A — ®, where ® is an £-formula, we write 


= . 


The reader will readily recognize the logical equivalence of the following pairs of formulas. 


Lemma 1.12. Suppose ®,®,,®2 and ®3 are formulas in some £(X). Then the following 
pairs of formulas are logically equivalent: 


G&D ® idempotent 
OVO ® laws 

©, & Bo ®, & commutative 
©, VO, OV laws 


®, & (@, & &;) 
®, V (By V 3) 
®, & (@) V 43) 


(D; V By) V B3 laws 


(@; & &y) V (®; & Bs) distributive 


D, V (By & 3) 
4 (; & ®) 

7 (, V ®2) 

®, — ®, 

®, — Po 

a3. @ 


(Db; V By) & (®; V ®3) 
(4 ®1) V (4@s) 

(41) & (4) 

(bd; — ®2) & (®2 > ,) 
(4 ®,) V By 

® 


(®, & y) & 5 associative 


laws 
de Morgan 


laws 
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PROOF. (Exercise.) 


The next list of equivalent formulas, involving quantifiers, may not be so familiar to the 
reader. 


Lemma 1.13. Jf ®,®,; and ®, are formulas in some L(X), then the following pairs of 
formulas are logically equivalent: 


Va(®, & By) =(Vr®,) & (Vro) 
( 


dx(b, V by) 4r®,) V (ar) 
Va® ® ifr do 
2 ® ifxé® 
Va(® V ®2) (Vr®,) V Bp ifr € ®, 
dx(®; & By) (drb,) & ifr € ®, 
aVx®(z) dr 4 O(z) 
a dar@(x) Va 3 O(2r) 
Va(®; > &2)  &, > (Vrs) ifx EZ ®, 
dx(®, > b2) , — (Ar) ifr Z , 
Va(®, > 2) = (ar@,) — 2 ifa Z By 
dx(@; + B®.) (Vr®)) > b, ifx ¢ ® 
Vr®(x) Vy®(y) peace replacing all free occurrences 
Jr® (x) Ay®(y) of x in B(x) by y does not lead 


to any new bound occurrences of y. 


ProoF. All of these are immediate consequences of the definition of satisfaction. In the 
last two cases let us point out what happens if one does not heed the “provided...” clause. 
Consider the formula ®(x) given by Jy(x % y). Replacing x by y gives dy(y % y). Now 
the sentence Vrdy(x % y) is true in any structure A with at least two elements, whereas 
Vary(y % y) is logically equivalent to dy(y # y), which is never true. 


Definition 1.14. If 6 € L(X) we define the length I(®) of ® to be the number of occurrences 
of the symbols &, V, 7, >,<,V, and J in ®. 


Note that [(®) =0 iff ©® is atomic. 


Lemma 1.15. Jf ®, 1s a subformula of ® and ®, 1s logically equivalent to ®2, then replacing 
an occurrence of ®, by ®2 gives a formula ®* which is logically equivalent to ®. 
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ProoF. We proceed by induction on 1(®). 

If (©) = 0 then ® is atomic, so the only subformula of ® is © itself, and the lemma 
is obvious in this case. So suppose /(@) > 1 and for any WV such that [(V) < 1(®) the 
replacement of an occurrence of a subformula of V by a logically equivalent formula leads to 
a formula which is logically equivalent to V. Let ®; be a subformula of ® and suppose ®, is 
logically equivalent to ®j. The case in which ©, = © is trivial, so we assume /(®,) < I(®). 
There are now seven cases to consider. Suppose ® is ®’ & ®”. Then the occurrence of ®, 
being considered is an occurrence in ®’ or ®”, say it is an occurrence in ®’. Let ®”* be the 
result of replacing ®, in ®’ by ®g. By the induction assumption ®” is logically equivalent to 
®’. Let ®* be the result of replacing the occurrence of ®; in ® by ®g. Then &* is &®* & &”, 
and this is easily argued to be logically equivalent to ®’ & ©”, i.e., to ®. Likewise one handles 
the four cases involving V,7,—,< . If ® is Vr®'(z,7) then let &”*(z,7) be the result of 
replacing the occurrence of ®; in ®'(x,y) by ®g. Then by the induction hypothesis 6” (x, 7) 
is logically equivalent to ®'(x, ¥), so given a structure A of type £ we have 


AF O%(2,7) 3 O(a, 9); 


hence 
Af ®"(a,7) @ (a, #) 


for a € A, so 
AE Vx"(2,7) iff AK Var®'(2, 7); 


thus ® is logically equivalent to Vz" (x,y). Similarly, we can handle the case Ar’ (z, 7). 


Definition 1.16. An open formula is a formula in which there are no occurrences of quan- 
tifiers. 


Definition 1.17. A formula ® is in prenex form if it looks like 
Oise Oca s) 


where each Q; is a quantifier and ®/(x,,...,2,,) is an open formula. ©’ is called the matrix 
of ®. 


Here, and in all future references to prenex form, we have the convention that no quan- 
tifiers need appear in the formula ®. 


Theorem 1.18. Every formula is logically equivalent to a formula in prenex form. 
PROOF. This follows from 1.12, 1.13, and 1.15. First, if necessary, change some of the bound 


variables to new variables so that for any variable x there is at most one occurrence of Vx as 
well as dz in the formula, both do not occur in the formula, and no variable occurs both as 
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a bound variable and a free variable. Then one simply pulls the quantifiers out front using 
1.13. 


EXAMPLE. The following shows how to put the formula Vz = (r(x, y) > der(z, z)) in prenex 
form. 


Va (r(x,y) — dar(a2,z)) ~ Ve 7(r(a, y) — Sur(w, z)) 


~ Ve 7 sw(r(2, y) - r(w, z)) 


~ VaVw a(r, (x,y) > r(w, z)). 


In view of the associative law for & and V, we will make it a practice of dropping parenthe- 
ses in formulas when the ambiguity is only “up to logical equivalence”. Thus ®, & ®, & ®3 
replaces (©; & ®2) & 3 and ®, & (®2 & 3), etc. Also, we find it convenient to replace 
®, & --- & ®, by Mien ®; (called the conjunction of the ®;), and ®;V---V®,, by V,<,<,, Bi 
(called the disjunction of the ;). a 


Definition 1.19. An open formula is in disjunctive form if it is in the form 
\V/ & 4; 
oe 


where each ®;; is atomic or negated atomic (i.e., the negation of an atomic formula). An 
open formula is in conjunctive form if it is in the form 


GV bs 
j 
where again each ®;; is atomic or negated atomic. 


Theorem 1.20. Every open formula is logically equivalent to an open formula in disjunctive 
form, as well as to one in conjunctive form. 


ProoF. This is easily proved by induction on the length of the formula by using the 
generalized distributive laws 


(V ») & (Vv v) ey VV & U;), 
(2,) v (du;) ~ Kha, v9), 
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the generalized De Morgan laws 


and the elimination of —,<, and —-. 


EXAMPLE. Let ® be the formula (with £ = {-,<}) 


(m-y 2) > I[(@ < z) Vw = 0)]. 


Then 
®w~ A(z-y Xz) Val(xz < z) V(x & 0)] 
wa(e-y & 2) V [Ala < z) & A(x & 0)] (in disjunctive form) 
~ [Atay & z)V A(x < z)] & [A(a-y & z) VA(a & O)] (in conjunctive form). 


The notions of subalgebra, isomorphism, and embedding can be easily generalized to 
first-order structures. 


Definition 1.21. Let A and B be first-order structures of type £. We say A is a substructure 
of B, written A < B, if A C B and the fundamental operations and relations of A are 
precisely the restrictions of the corresponding fundamental operations and relations of B to 
A. If X C B let Sg(X) be the smallest subset of B which is closed under the fundamental 
operations of B. The substructure Sg(X) with universe Sg(X) (assuming Sg(X) 4 @) is 
called the substructure generated by X. As in II§3 we have |Sg(X)| < |X|+ |F|+w. If K is 
a class of structures of type &, let S(K’) be the class of all substructures of members of K. 


A very restrictive notion of substructure which we will encounter again in the next section 
is the following. 


Definition 1.22. Let A,B be two first-order structures of type £. A is an elementary 
substructure of B if A < B and for any sentence ® of type £4 (and hence of type Lz), 


AF® if BE®. 


In this case we write 


A ~<B. 


S‘)(K) denotes the class of elementary substructures of members of K. 
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EXAMPLE. Let us find the elementary substructures of the group of integers Z = (Z,+,—,0). 
Suppose A ~ Z. As Z is a group, it follows that A is a group. 


Z & Ariy(r # y), 


sO 


A - Ardy(x ¥ y); 


hence A is nontrivial. Thus for some n > 0,n € A. As 


Ze aAg(e+e+:--+22Nn), 
where there are n x’s added together, it follows that A satisfies the same; hence 
1EA. 


But then 
A=Z. 


Definition 1.23. Let A and B be first-order structures of type £& and suppose a: A — B 
is a bijection such that 
Ly (Gigs: On) =f (ayy sua) 

for f a fundamental operation, and that r(a),...,@,) holds in A iff r(aa,...,@a,) holds in 
B. Then a is an isomorphism from A to B, and A is isomorphic to B (written A = B). If 
a: A-— Bis an isomorphism from A to a substructure of B, we say a is an embedding of 
A into B. Let (4) denote the closure of K under isomorphism. An embedding a: A > B 
such that aA ~< B is called an elementary embedding. 


EXERCISES §1 


1. In the language of semigroups {-}, find formulas expressing (a) “x is of order dividing 
n,” where n is a positive integer, (b) “x is of order at most n,” (c) “x is of order at 
least n.” 


2. Find formulas which express the following properties of structures: (a) A “has size at 
most n,” (b) A “has size at least n.” 


3. Given a finite structure A for a finite language show that there is a first-order formula 
® such that for any structure B of the same type, BE ® iff B= A. 


Given a graph (G,r) and g € G, the valence or degree of g is |{h € G: hrg}|. 


4. In the language of graphs {r}, find formulas to express (a) “x has valence at most n,” 
(b) “a has valence at least n,” (c) “2 and y are connected by a path of length at most 


” 


n. 
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5. Show that the following properties of groups can be expressed by first-order formulas: 
(a) G “is centerless,” (b) G “is a group of exponent n,” (c) G “is nilpotent of class 
k,” (d) “a and y are conjugate elements.” 


A property P of first-order structures is first-order (or elementary) relative to kK, where 
K is a class of first-order structures, if there is a set © of first-order formulas such that for 
A€éK, A has P iff A — &. If we can choose © to be finite, we say that P is strictly first- 
order (or strictly elementary). Similarly, one can consider properties of elements of first-order 


structures relative to K. 


6. Show that “being of infinite size” is a first-order property (relative to any Kk). 


7. Relative to the class of graphs show that the following properties are first-order: (a) 
“x has infinite valence,” (b) “x and y are not connected.” 


8. Prove that if A = B, then AF © iff B 


= ®, for any ®. 


9. Let K = {N} where N is the natural numbers (N,+,-, 1). Show that relative to K the 
following can be expressed by first-order formulas: (a) “x < y,” (b) “rly,” (c) “x isa 


prime number.” 


10. Put the following formula in prenex form with the matrix in conjunctive form: 


Valery > dy(ary — Ax(yre & xry))}. 


11. Does the following binary structure (Figure 31) satisfy 


Valsy(ary @ dx(ary))]? 


Figure 31 


12. Express the following in the language {r}, where r is a binary relation symbol: 


(a) (A,r) “as a partially ordered set,” 
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(b) (A,r) “is a linearly ordered set,” 
c) (A,r) “is a dense linearly ordered set,” 


( 
(d 


r “is an equivalence relation on A,” 


e) r “is a function on A,” 


(f) r “is a surjective function on A,” 


) 
) 
) 
) 
) 
) 


(g) r “is an injective function on A.” 


A sentence ® is universal if ® is in prenex form and looks like 


where W is open, i.e., ® contains no existential quantifier. 


13. 


14. 


15. 


16. 


Li 


18. 


19. 


Show that substructures preserve universal sentences, i.e., if A < B and ® is a universal 
sentence, then 


BEO®SAES®. 


Show that in the language {-}, the property of being a reduct (see II§1 Exercise 1) of 
a group is first-order, but not definable by universal sentences. 


Show that any two countable dense linearly ordered sets without endpoints are isomor- 
phic. [Hint: Build the isomorphism step-by-step by selecting the elements alternately 
from the first and second sets.| 


Can one embed: 


(a) (w,<,+,0) in (w,<,-,1)? 
(b) (w,<,-,1) in (jw, <,+,0)? 


Let A be a finite structure. Describe all possible elementary substructures of A. 


Let A be a countable dense linearly ordered set without endpoints. If B is a substruc- 
ture of A which is also dense in A, show B ~ A. 


Find all elementary substructures of the graph (called a rooted dyadic tree) pictured 
in Figure 32. 
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Figure 32 


If we are given two structures A and B of type £, then a mapping a: A > Bisa 
homomorphism if (i) af(a1,...,@n) = f(aa,...,@a,) for f € F, and (ii) r(ai,...,@n) > 
r(aay,...,Q@dn) forr ER. 

If a is a homomorphism we write, as before, a: A — B. The image of A under a, 
denoted by aA, is the substructure of B with universe aA. The homomorphism a is an 
embedding if the map a: A — aA is an isomorphism. A sentence ©® is positive if it is in 
prenex form and the matrix uses only the propositional connectives & and V. 


20. Suppose a : A — B is a homomorphism and ® is a positive sentence with A — ®. 
Show aA — ©; hence homomorphisms preserve positive sentences. 


21. Let £ = {f} where f is a unary function symbol. Is the sentence VrVy(fa = fy — 
x & y) logically equivalent to a positive sentence? 


22. Is (a) the class of 4-colorable graphs, (b) the class of cubic graphs, definable by positive 
sentences in the language {r}? 


23. Show every poset (P,<) can be embedded in a distributive lattice (D, <). 


A family @ of structures is a chain if for each A,B € €C either A< Bor B< A. If C is 
a chain of structures, define the structure UC by letting its universe be U{A: A € C}, and 
defining f(a,,...,a,) to agree with fA(a,,...,a,) for any A € @ with a,,...,a, € A, and 
letting r(a1,...,@n) hold iff it holds for some A € €. 

A sentence ® is an Va-sentence iff it is in prenex form and it looks like Vz, ...Vt%mayi... dyn, 
where W is open. 


24. If C is a chain of structures and ©® is an Va-sentence such that A — ® for A € C, show 
that UC £ ©. 


25. Show that the class of algebraically closed fields is definable by Va-sentences in the 
language {+,-,—, 0, 1}. 


232 V Connections with Model Theory 


26. The class of semigroups which are reducts of monoids can be axiomatized by 


VavyVz[(z-y)- 2 x-(y-z)] 


wWy(ycerc-yk&y- rey). 


Can this class be axiomatized by Va-sentences? 


Given a nonempty indexed family (A,)j;<; of structures of type £, define the direct product 
I]<_7 Ai to be the structure A whose universe is the set [[,-, A;, and where fundamental 
operations and relations are specified by 


FA(ar,...,4n)(t) = f* (ar(Z),.--, @n(@)) 
r“(a1,...,@n) holds iff for all i € J, r“*(a;,(i),...,@n(é)) holds. 


tel. 


27. Given homomorphisms a; : A > B,, i € J, show that the natural mapa: A = [J,<; Bi 
is a homomorphism from A to [],-; Bi. 


28. Show that a projection map on [],_,; A; is a surjective homomorphism. 


i€l 
A Horn formula ® is a formula in prenex form which looks like 
OTieccOut a (&0,) 
where each Q; is a quantifier, and each ®; is a formula of the form 
WiV---V Uy, 
in which each WV; is atomic or negated atomic, and at most one of the WV, is atomic. 


29. Show that the following can be expressed by Horn formulas: (a) “the cancellation law” 
(for semigroups), (b) “of size at least n,” (c) any atomic formula, (d) “inverses exist” 
(for monoids), (e) “being centerless” (for groups). 


30. If ® is a Horn formula and A; — ® for z € J, show that 


][AF«¢ 


we. 


A substructure A of a direct product [],<; Ai is a subdirect product if 7;(A) = A; for all 
i € I. An embedding a: A = [J,-; Aj is a subdirect embedding if aA is a subdirect product. 
A sentence ® is a special Horn sentence if it is of the form 


where each ®; is positive and each WV; is atomic. 
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31. Show that a special Horn sentence is logically equivalent to a Horn sentence. 
32. Show that if A is a subdirect product of A;, 7 € J, and ® is a special Horn sentence 
such that A; — ® for alli € J, then A — 9; hence subdirect products preserve special 


Horn sentences. 


33. Can the class of cubic graphs be defined by special Horn sentences? 


A complete graph (G,r) is one satisfying 
VaVy(a # y > ary). 


A complete graph with one edge removed is almost complete. 


34. Show that every graph is subdirectly embedded in a product of complete and/or almost 
complete graphs. 


35. If A is an algebra of type F with a discriminator term t(z, y, z) [and switching term 
s(x, y, u, v)| show that A satisfies (see IV§9) 


(preqg&prg otp,¢p) = tap,” 
(p= qV Pq) — 8(p,4,P,4) © P 
(py qVP¥#Q) > S(P,¢),9) * 4 


and if A is nontrivial, 
(p # q)  Valt(p,q, 2) Pp]. 


Show that, consequently, if A is nontrivial, then for every [universal] F-formula ¢ there 
is an [universal] F-formula ¢* whose matrix is an equation p ~ q such that A satisfies 


oor. 


Define the spectrum of an £-formula ¢, Spec ¢, to be {|A|: A is an £-structure, A ¢, A 
is finite}. 


36. (McKenzie). If ¢ is an F-formula satisfied by some A, where F is a type of algebras, 
show that there is a (finitely axiomatizable) variety V such that Spec V (see IV89 
Exercise 4) is the closure of Spec @ under finite products. 
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§2. Reduced Products and Ultraproducts 


Reduced products result from a certain combination of the direct product and quotient con- 
structions. They were introduced in the 1950’s by Los, and the special case of ultraproducts 
has been a subject worthy of at least one book. In the following you will need to recall the 
definition of [a = 6] from IV§5.5, and that of direct products of structures from p. 232. 


Definition 2.1. Let (A,);c; be a nonempty indexed family of structures of type £, and 
suppose F' is a filter over J. Define the binary relation 07 on [],-, A; by 


i€l 
(a,b) € Op iff [a=b] € F. 
(When discussing reduced products we will always assume @ ¢ F’, i.e., F' is proper.) 


Lemma 2.2. For (A;)ic; and F as above, the relation 0 is an equivalence relation on 
[ep Ai. For a fundamental n-ary operation f of |],<, Ai and for 


(a1, 1), abet (dn; bn) — Or 


we have 
(F(a1,-+-,4n), f(b1,---,0n)) € Op, 


1.€., Op is a congruence for the “algebra part of A”. 
PROOF. Clearly 0p is reflexive and symmetric. If 


(a, b), (b,c) € Or 


then 

[a = 9], [b=cl EF, 
hence 

[e=b]N[b=ceF 
Now from 


it follows that 


SO 
(a,c) € Op. 


Consequently, #7 is an equivalence relation. Next with f and (aj, b;) as in the statement of 
the lemma, note that 


[f(a1,--.,@n) = f(b1,-.-,bn)] 2 [ar = bi] N--- 9 [an = da]; 
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hence 


sO 


(f (a1, Paar cia f (bi, ay bn)) E Op. 


Definition 2.3. Given a nonempty indexed family of structures (A,)jc; of type & and a 
proper filter F over J, define the reduced product |[,-; Ai/F as follows. Let its universe 
[ie Ai/F be the set [],-, Ai/Or, and let a/F denote the element a/@p. For f an n-ary 
function symbol and for a1,...,an € [<7 Ai, let 


Jai) Paes qa] PYF (Gee Oa iP: 
and for r an n-ary relation symbol, let r(a,/F,...,@,/F') hold iff 


{iel: A; Er(ai(t),...,an(s))} EF. 


If K is a nonempty class of structures of type £, let Pr(K) denote the class of all reduced 
products |[,-, A;/F, where A; € K. 


il 
In view of Definition 2.3, it is reasonable to extend our use of the |] notation as follows. 


Definition 2.4. If (A,);<-; is a nonempty indexed family of structures of type £ and if 
P(a1,...,@n) is a sentence of type £4, where A = [],-; Ai, let 


[®(a1,...,an)] = {i € 1: Ai & ®(a1(2),..., an(z))}. 


Thus given a reduced product [],-; A;/F and an atomic sentence ®(a,,...,a,), we see 


that 


tel 


[A/F E G@/F,...,4n/F) iff [O(cu,...,a,)] €F. 
i€l 
Determining precisely which sentences are preserved by reduced products has been one 
of the milestones in the history of model theory. Our next theorem is concerned with the 
easy half of this study. 


Definition 2.5. A Horn formula is a formula in prenex form with a matrix consisting of 
conjunctions of formulas ©; V --- V ®, where each ®; is atomic or negated atomic, and at 
most one ®; is atomic in each such disjunction. Such disjunctions of atomic and negated 
atomic formulas are called basic Horn formulas. 


The following property of direct products is useful in induction proofs on reduced prod- 
ucts. 
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Lemma 2.6 (The maximal property). Let A;, i € I, be a nonempty indexed family of 
structures of type &. If we are given a formula 4x®(x,y1,.--,Yn) of type & and ay,..., an € 
[ier Ai, then there is ana € [],-, Ai such that 


we. 


[AGO a Gye vasiig)|) = [Olapayaies Ge): 


PROOF. For 


i € [Ar®(2,a1,...,an)] 
choose a(z) € A; such that 
A; — ®(a(7), a,(),...,an(4)), 


and for other 7’s in J, let a(z) be arbitrary. Then it is readily verified that such an a satisfies 
the lemma. 


Theorem 2.7. Let [[,<; Ai/F be a reduced product of structures of type &, and suppose 
®(x1,...,%n) is a Horn formula of type &. If 


G1, 6.650 € |] Ai 


wel 


and 
[®(a1, ares , On) ef 


then 
[[Ad/F E S(ai/F,..., 4n/F). 


wel. 


PROOF. First let us suppose ® is a basic Horn formula 
DiGig resin) Vt WOE 0 Wa): 


Our assumption 


is equivalent to 


If, for some ®; which is a negated atomic formula we have 


[- [®;(a1,...,@n)] ¢ F, 
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then, by the definition of reduced product, 


[[Ai/F FE Oila/F,...,¢n/F); 


iI 
hence 


[ [A/F E O(a/F....,4n/F). 


wel 


If now for each negated atomic formula ®; we have 
I- [®;(a1,. as sstlech| € F, 


then there must be one of the ®;’s, say ®,, which is atomic. (Otherwise 


I—[®(a,...,4n)] =I- |) [Gila,...,0,)] € F, 


1<i<k 
which is impossible as F' is closed under intersection and @ ¢ F.) Now in this case 
[- ®; (a1, cee An) CF 


for1<i<k-—1,s0 


Since 


taking the intersection we have 


[( & 74,(a1,--++4)) k x(a...) CF, 
1<i<k-1 


sO 
[®,(a1,..-,@n)] € F. 
This says 
[A/F KE Sk (ai/F,...,4n/F); 
iel 
hence 


[[Ai/F EF (ai/F,...,4n/F). 


ie] 
If ® is a conjunction 
Wi & --- OV 
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of basic Horn formulas, then 
[Vi(ai,..., Qn) & eee & W,(a1,.--,4n)] eF 
leads to 


for 1iso4 =k 80 
[[Aid/F KE Vila/F,...,¢n/F), 


tel 


1<i<hk, and thus 
[[4/F I O(0j/ F221, Oni F): 


wel. 


Next we look at the general case in which © is in the form 


Oi eNO pia Ui Gis eee) 


with WV being a conjunction of basic Horn formulas. We use induction on the number of 
occurrences of quantifiers in ®. If there are no quantifiers, then we have finished this case in 
the last paragraph. So suppose that the theorem is true for any Horn formula with fewer than 
m occurrences of quantifiers. In ® above let us first suppose Q, is the universal quantifier, 
ome 

P= VypO" (ys Biya): 


If we are given a € [J,-; Ai, then from 


it follows that 
as 


By the induction hypothesis 
[[ AVF E O(@/F.a/F,...,¢n/F); 
i€l 


hence 
[ [A/F KE O(ai/F,...,4n/F). 


wel. 


Next suppose Q, is the existential quantifier, i.e., 


® = 


yi ®* (yi, 1,---,En)- 
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Choose by 2.6 a € [[,<; A; such that 
[Plat ..+5@n)] = [0" (a, ais. -4 Gan) I. 
Then again by the induction hypothesis 
[[Aid/F EK @ (G/F a/F,...,0n/F); 
i€l 


hence 


[[Ai/F E S(ai/F,...,4n/F). 


we. 


The following generalizes the definition of ultraproducts in IV86 to arbitrary first-order 
structures. 


Definition 2.8. A reduced product [],-; Ai/U is called an ultraproduct if U is an ultrafilter 
over I. If all the A; = A, then we write A//U and call it an ultrapower of A. The class of 
all ultraproducts of members of K is denoted Py(K). 


For the following recall the basic properties of ultrafilters from IV83. We abbreviate 
Giig.'s45. Oy DY G;,ANd 047 Uy o55 GU bya fu, 


Theorem 2.9 (Los). Given structures Aj, i € I, of type £, if U is an ultrafilter over I and 
® is any first-order formula of type £, then 


[[AsV/U EF O(a/U,...,an/U) 


tel 


PRrooF. (By induction on /(®).) For /(®) = 0 we have already observed that the theorem is 
true. So suppose /(®) > 0 and the theorem holds for all UV such that I(W) < 1(®). If 


=, & O, 
then 
[2(@) & Ba] EU iff [8,(A] N[S.(B] €U 
iff [®(a@)]e@U fori =1,2 
iff [[A/U- 0(4/U) fori =1,2 


iff |] Ai/U - ®(G/U) & ,(4/U). 
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One handles the logical connectives V,7, >, <> in a similar fashion. If 


®(d) = 4r®(z, a), 


choose a € [],-, Ai such that 


[3x6(x, @)] = [O(a,@)] 
Then 
[ea] eU iff [ArO(x,a)] €U 
iff [O(a,@)]¢U for some a 
iff [[4/v - b(a/U,a/U) for some a 
ier 
iff |] Ai/U - Ar0(a,a/U) 
ier 
iff [[Ai/U - ®@/U). 
i€l 
Finally, if 


®(@) = Vx®(cx, @) 


then one can find a (a) such that the quantifier V does not appear in V and ® ~ W (by 
1.13), hence from what we have just proved, 


[@]cU iff [V@)cU 


iff [[Ai/U - W(a/v) 
iff |] Ai/U - ®@/U) 


Lemma 2.10. Let A be a first-order structure, I a nonempty index set and F a proper filter 
over I. Fora € A, let cq, denote the element of A!’ with 


Cid) = "0s wel. 
The map 
a:A7A!l/F 
defined by 
OG = Call 


is an embedding of A into A!/F. The map a is called the natural embedding of A into A! /F. 
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PROOF. (Exercise.) 


Theorem 2.11. Jf A is a first-order structure of type &, I is an index set, and U is an 
ultrafilter over I, then the natural embedding a of A into A!/U is an elementary embedding. 


ProoF. Just note that for formulas ®(271,...,2,,) of type £, we have 


[Des i423 5 Ca, Wed AE ALS O(ag fcc 06) 


and 
[Plies Ce eH a at AE Olas cso, ): 


Thus 
aA ®(aa1,...,aan) iff A’/U EK ®(aa,..., aan). 


Next we prove one of the most celebrated theorems of logic. 


Theorem 2.12 (The Compactness Theorem). Let © be a set of first-order sentences of type 
£ such that for every finite subset Ug of X there is a structure satisfying No. Then A E ¥ 
for some A of type &. 


PROOF. Let I be the family of finite subsets of /, and for 7 € J let A; be a structure 
satisfying the sentences in 7. For 2 € I let 


Then 
Tall = al iiies 


so the collection of J;’s is closed under finite intersection. As no J; = @ it follows that 
F={JCI: JCJ for somei e€ I} 


is a proper filter over J, so by [V83.17 we can extend it to an ultrafilter U over J; and each 
J; belongs to U. Now for ® € © we have 


{®} € J, 


sO 
A;F® forj € Ja} 


as ® € j. Looking at [],-; Ai we see that 


[®] 2 Jre} 
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sO 


[[A/U @: 


ic 
hence 


[[4/UF =. 


tel 


Corollary 2.13. Jf is a set of sentences of type & and ® is a sentence of type & such that 
EK, 


then, for some finite subset Xo of d, 
Mo & ®. 


PrRooF. If the above fails, then for some © and © and for every finite subset “io of & there is 
a structure A which satisfies No but not ®; hence XpU{- 9} is satisfied by some A. But then 
2.12 says & U {= ®} is satisfied by some A, which is impossible as A / © implies A — ©. 


A slight variation of the proof of the compactness theorem gives us the following. 


Theorem 2.14. Every first-order structure A can be embedded in an ultraproduct of its 
finitely generated substructures. 


Proor. Let J be the family of nonempty finite subsets of A, and for 7 € J let A; be the 
substructure of A generated by 7. Also, for i € I let 


As in 2.12 extend the family of J,’s to an ultrafilter U over I. For a € A let Aa be any 
element of [[,_; A; such that 
(Aa)(7) =a 
if a € z. Then let 
a:A>][A/U 


iel 
be defined by 
aa = (Aa)/U. 
For ®(21,...,2,) an atomic or negated atomic formula and aj,...,a@, € A such that 


A FE ®(a1,...,@n), 
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we have 


[®(Aaz, ace Aan) | > Bien J an}3 


hence 
a(A) & ®(Aa,/U,..., Aan /U). 


This is easily seen to guarantee that @ is an embedding. 


For the remainder of this section we will assume that we are working with some convenient 
fixed countably infinite set of variables X, i.e., all formulas will be over this X. 


Definition 2.15. A class K of first-order £-structures is an elementary class (or a first-order 
class) if there is a set © of first-order formulas such that 


Ack iff AF. 


K is said to be aziomatized (or defined) by % in this case, and © is a set of axioms for K. 
Let Th(k) be the set of first-order sentences of type £ satisfied by K, called the theory of 
K. 


Theorem 2.16. Let K be a class of first-order structures of type &. Then the following are 
equivalent: 


(a) K is an elementary class. 
(b) K is closed under I,.S), and Py. 
(c) 1S) Py(K*), for some class K*. 


ProoF. For (a) > (b) use the fact that each of I, S‘~) and Py preserve first-order properties. 
(b) = (c) is trivial, for let A* = K. For (c) => (a) we claim that K is axiomatizable by 
Th(A*) where A* is as in (c). Note that K / Th(A*). Suppose 


AE Th(K*). 


Let Th*(A) be the set of sentences ®(a1,...,d@,) of type £4 satisfied by A, and let J be the 
collection of finite subsets of Th*(A). If 


®(a1,...,4n) € Th*(A) 


then for some B € k*, 


Baas ep Ol tiee ne) 


For otherwise 
BY SAGs Vigo O Gis ess 


which is impossible as 


Ae Ol G i 5 Bu) 
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and A — Th(K*). Consequently, for i € I we can choose A; € AK* and elements G(i) € A; 
for a € A such that the formulas in 7 become true of A; when a is interpreted as a(i), for 
a € A. Let 

keal{7j elit Ga} 
and, as before, let U be an ultrafilter over J such that J; € U for 2 € I. Let @ be the element 
in [[,<, Ai whose ith coordinate is a@(z). Then for 


®(a1,...,4n) € Th*(A) 


we have 
[®(a1,...,4,)] Dg EU 
where 
4 = {®(ay,. , An) }; 
hence 
[®(a1,...,an)] € U. 
Thus 


[[AsV/U F OG/U,..., 4/0). 


wel. 


By considering the atomic and negated atomic sentences in Th*(A), we see that the mapping 


a:A- [[4/v 
iel 
defined by 
aa = a/U 


gives an embedding of A into [[,., Ai/U, and then again from the above it follows that the 
embedding is elementary. Thus A € IS) Py(K*). 


Definition 2.17. An elementary class K is a strictly first-order (or strictly elementary) 
class if K can be axiomatized by finitely many formulas, or equivalently by a single formula. 


Corollary 2.18. An elementary class K of first-order structures is a strictly elementary 
class iff the complement K' of K is closed under ultraproducts. 


PROOF. If K is axiomatized by ®, then the complement of K is axiomatized by = ©; hence 
kK’ is an elementary class, so K’ is closed under Py. Conversely suppose K’ is closed under 
Py. Let I be the collection of finite subsets of Th(A’). If K is not finitely axiomatizable, for 
each 7 € I there must be a structure A; such that 
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but 
A, ¢ K. 
Let 


and construct U as before. Then 


[[A/UF @ 


tel 


for 6 € Th(k) as 
[®] =) Jo} EU. 


Thus 
[[A/U E THK), 
ier 
sO 
[[Av/u € x. 
EL 


But this is impossible since by the assumption 


[[A/vUex' 


wel 


as each A; € kK’. Thus K must be a strictly elementary class. 


Definition 2.19. A first-order formula ® is a universal formula if it is in prenex form and all 
the quantifiers are universal. An elementary class is a universal class if it can be axiomatized 
by universal formulas. 


Theorem 2.20. Let K be a class of structures of type &. Then the following are equivalent: 


(a) K is a universal class, 
(b) K is closed under I, S, and Py, 
(c) K =ISPy(k*), for some k*. 


PROOF. (a) => (b) is easily checked and (b) = (c) is straightforward. For (c) = (a) let 
Thy(A“*) be the set of universal sentences of type £ which are satisfied by A*, and suppose 
A — Thy(k*). Let Th§(A) be the set of universal sentences of type £4 which are satisfied 
by A. Now we just repeat the last part of the proof of 2.16, replacing Th* by Th. 


Definition 2.21. A first-order formula ® is a universal Horn formula if it is both a universal 
and a Horn formula. A class K of structures is a universal Horn class if it can be axiomatized 
by universal Horn formulas. 


Before looking at classes defined by universal Horn formulas we need a technical lemma. 
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Lemma 2.22. The following inequalities on class operators hold: 


(a) P < IPp, 
(b) PrPr < IPpr, 
(c) Pee ISP: 


PROOF. (a) Given [[,-; Ai let F = {J} be the smallest filter over J. Then one sees that 


[[ 4:2 [[4/F 


wel wel 


using the map a(a) = a/F. 
(b) Given a set J and a family of pairwise disjoint sets [;,j7 € J, and algebras A; for 
7 € JI, and a filter F over J and for j € J a filter F'; over I;, define 


FS (5 


jeJ 


and let 
F={SCI:{je 3:80 €F} € Fh. 


Then F is easily seen to be a filter over I, and we will show that 
IL | [[ 4/4 /r= Tae 
jes \iel; iel 


For each j € J define 


by 


Then a; is a surjective homomorphism from ]];<; Ai to []j<7, Ai. Let 
Vj: [[ 4: = [4/4 
i€l; i€l; 
be the natural mapping. Define 


6: [4] | []4/e 


iel jeJ \iel; 
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to be the natural mapping derived from the v;’s, i.e., 


Let 


v:|[ i [ [4/6 |) [| [4/4 /s 


jet \iel; jet \iel; 
be the natural map (see Figure 33.) The mapping v o @ is surjective as each of v and (3 is 
surjective. Also let 
iel iel 


be the natural map. Let us show that 
ker(v 0 3) = Op. 
We have 


(a,b) € ker(v o 8) & (Ga, Bb) € kerv = Op 
& [Ga = Bb] € F 
@ {7 €J:0;(al,) =vj(bly)} € F 
&{jeJ:fa=)nLer}er 
© [a =)] € F. 


Thus we have a bijection 
iel jes \iel; 


such that yov* = vo (3. If we were working in a language of algebras, we could use the first 
isomorphism theorem to show ¥y is an isomorphism. We will leave the details of showing that 
y preserves fundamental relations to the reader. 
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TTA, TT (TT A,/F 
fey ti B jedi fi) 
O: 
Vv 
Y 
I. as ae 
; TTA,/F TT (1TA,/F)/F 
ie] jeJ iel J 
Vj 
g A/F 


Figure 33 


(c) If F is a filter over J, let J be the set of ultrafilters over J containing F. Given 
A;,1 € I, define, for U € J, 


ay : [[4:/F - [[4/¥ 


ier wel 
by 
ay(a/F) =a/U, 


and then let 


ier UeJ \iel 


Q: [[4/F = I] (ave) 
be the natural map. We claim that since one clearly has 
PSs 
we must have an injective map a. For if 
a/F #b/F 


then 
[a=] ¢F 


so we can find an ultrafilter U extending F' with 


[a =}] ZU. 
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Thus 
ay (a) 4 ay(d) 


so @ is injective. If we were working with algebras, we would clearly have an embedding, 
and we again leave the details concerning fundamental relations to the reader. 


Theorem 2.23. Let K be a class of structures of type &. Then the following are equivalent: 


(a) Kk is a universal Horn class, 

(b) K is closed under I, S, and Pp, 
(c) K is closed under I, S,P, and Py, 
(d) K = ISPp(k*), top some k*, 

(e) K =ISPPy(K*), for some k*. 


PROOF. (a) = (b) is easily checked using 2.7, and (b) = (c), (b) = (d) and (c) => (e) are 
clear. For (d) = (a) and (e) => (a) let Thyy(K™*) be the set of universal Horn sentences of 
type £ which are true of K*. Certainly K —/ Thy,(K*). Suppose 


AK Thyy(K°). 


Let Th5(A) be the set of atomic or negated atomic sentences true of A in £4. (This is called 
the open diagram of A.) If we are given 


{®(a1,...,@n),-.-, Px (ai,...,@n)} C Th§(A) 


then 


ASG, el Oily a A) Oe es CD Be) | 


We want to show some member of P(K“) satisfies this sentence as well. For this purpose it 
suffices to show 


P(K*) FVa1...Vtn[> ®1 (41, ...,0n) V+ V Ap (21,.-.,2n)]- 


If at most one ®; is negated atomic, then the universal sentence above would be logically 
equivalent to a universal Horn sentence which is not true of A, hence not of K*. So let us 
suppose at least two of the ®; are negated atomic, say ®; is negated atomic for 1 <i <t 
(where 2 <t<k), and atomic fort+1<i<k. Then, for 1 <i < t, one can argue as above 
that 


POP Va is Vina OA Bight) VO ais ak a) Vee V Opa es 


hence for some A; € K*, 


Ay © agi 2 Se, (Oy, Sn) & Oni (Giger -4 Gn) <? & Oglai ys 4-H): 
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For 1<i<t,1<j <n, choose a,(i) € A; such that 


A; K ®;(a1(i),...,@n(i)) & Beyr(ar(d),..-,dn(i)) & «++ & B(ar(a),...,an(4)). 


and 


Let I be the collection of finite subsets of Thj(A), and proceed as in the proof of 2.16, 
replacing Th*(A) by Th§(A), to obtain 


A €ISPyP(K*). 


From 2.22, 
ISPr =< ITSPPy 1S PePr < TSP: 
hence 
ISPp =ISPPy. 
Now 
ISPyP < ISPpPpr = IS'Pp; 
hence 


A €ISPyP(K*) C ISPp(K*) = ISPPy(K*) = K. 


Let us now turn to algebras. 


Definition 2.24. A quasi-identity is an identity or a formula of the form (p; & q & «++ & pn & 
Qn) ~p&q. A quasivariety is a class of algebras closed under J, S, and Pr, and containing 
the one-element algebras. 


Theorem 2.25. Let K be a class of algebras. Then the following are equivalent: 


(a) K can be axiomatized by quasi-identities, 

(b) K is a quasivariety, 

(c) K is closed under I,S,P, and Py and contains a trivial algebra, 
(d) K is closed under IS Pp and contains a trivial algebra, and 

(e) K is closed under 1S PPy and contains a trivial algebra. 


PROOF. As quasi-identities are logically equivalent to universal Horn formulas, and as trivial 
algebras satisfy any quasi-identity, we have (a) => (b). (b) = (c), (b) = (d) and (c) => (e) 
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are obvious. If (d) or (e) holds, then A can be axiomatized by universal Horn formulas by 
2.23 which we may assume to be of the form Vz,...Vz,(W, V--- V U;,) with each WV; an 
atomic or negated atomic formula. (Why?) As a trivial algebra cannot satisfy a negated 
atomic formula, exactly one of V,,..., WV, is atomic. Such an axiom is logically equivalent 
to a quasi-identity. 


For us the study of universal algebra has been almost synonymous with the study of 
varieties, but the Russian mathematicians under the leadership of Mal’cev have vigorously 
pursued the subject of quasivarieties as well. 


EXAMPLE. The cancellation law 

LY Ru-zroyrRz 
is a quasi-identity. 
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EXERCISES §2 


1. If R is the ordered field of real numbers, show that R“/U is a non-Archimedean ordered 
field if U is a nonprincipal ultrafilter on w. Show that the class of Archimedean ordered 
fields is not an elementary class. 


2. With P the set of prime numbers, show that [],,-p Z/(p) is a ring of characteristic zero. 
Hence show that “being a field of finite characteristic” is not a first-order property. 


3. Show that “being a finite structure of type £” is not a first-order property. 


4. Show that “being isomorphic to the ring of integers” is not a first-order property. [Hint: 
Use IV§6 Exercise 7.] 


5. Prove that the following hold: (a) PyS < ISPy; (b) PrS < ISPp. 
6. Prove that a graph is n-colorable iff each finite subgraph is n-colorable. 


Given two languages £, £' with £ C £L’ and a structure A of type L’, let Af, denote the 
reduct of A to &, i.e., retain only those fundamental operations and relations of A which 
correspond to symbols in £. Then define K [g= {Afc: A € K}. 
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7. Let K be an elementary class of type £’, and let A be a structure of type £, £ CL’. 


Show that A € [S(K 


fe) iff A Thy Tz). 


8. Prove that a group G can be linearly ordered iff each of its finitely generated subgroups 


can be linearly ordered. 


9. If ® is a sentence such that A 


10. If ® is a sentence such that kK — ® > SPpr(Kk) 


E ® = S(A) — ®, then show that © is logically 
equivalent to a universal sentence. 


= ®, then show that ©® is logically 


equivalent to a universal Horn sentence. 


11. Given a language £ let kK be an elementary class and let ® be a sentence such that for 
A,Be kK with B<A,if A © then B - ®. Show that there is a universal sentence 
WV such that K — ® — W. [Hint: Make appropriate changes in the proof of 2.20.] 


12. Given a first-order structure A of type £ let D*(A) be the set of atomic sentences in 
the language £4 true of A. Given a set of sentences © of type £, show that there is a 
homomorphism from A to some B with B — »® iff there is a C with C E D*(A)U®. 


§3. 


Principal Congruence Formulas 


Principal congruence formulas are the obvious first-order formulas for describing principal 
congruences. We give two applications of principal congruence formulas, namely McKenzie’s 
theorem on definable principal congruences, and Taylor’s theorem on the number of subdi- 
rectly irreducible algebras in a variety. Throughout this section we are working with a fixed 
language F of algebras. First we look at how to construct principal congruences using unary 


polynomials. 


Lemma 3.1 (Mal’cev). Let A be an algebra of type ¥ and suppose a,b,c,d € A. Then 


(a,b) € O(c, d) 


iff there are terms 


pila, Y1,--- 
ex € A such that 


1<i<m, and elements e1,..., 


UR) 


a= pi (Sine); 


pilti, 
Pm(t m> 


é 
a) = 


where 


)= pre €.) 


for Lt ni, 


{5i, ti} = {c, d} 
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for 1:7 =m. 


PROOF. Let p;(z, y1,.--,Yx) be any terms of type F and let e;,...,e, be any elements of A. 
Then clearly 


(pilc, €), pi(d, &)) € O(c, d); 
hence if 
{Sick p= 1a} 
and 
pilti, @) = pisa(Sit1, €) 
then by the transitivity of O(c, d), 


(pi($1,€); Pm(tm; €)) € OC, d). 


Thus the collection @* of pairs (a,b) such that there are p;’s and e,’s as above form a subset 
of O(c, d). Now note that 6* is an equivalence relation, and indeed a congruence. For if 


(a;, b;) E 0, 
1<j<n, and if f is a fundamental n-ary operation, let 


a; = pyi(8j15 €;), 


Doal bees C) — Pia Sri e,), 


and 
Diag tims) = 05: 
Then 
f(b, shite as a5, ale SOs) = f(b, areiens bj-1, Pyi(Ss1; é), Qj+1; er seg Qn), 
f(b, teey Cases ir ree Cr) Ajtis+++5 An) = f(b, oe) Oe: nig Dara (Sat €;); Aj+1,--- fa). 


1<i<m,, and 


J (Digs: Feigao sare ej) Ajtiy+++ , On) = f(bi,--- 5 Dj 15 OF Oj aay On); 
hence 
(f (b1, er Opa aj, oe a) f (bi, eed £0; Qj+1, bea ei) — as 
so by transitivity 
(f(a, was An); f(h1, wae 1 On)) el. 


As 
(c,d) € &* C O(c, d) 
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we must have 


O(c,d) =, 


since O(c, d) is the smallest congruence containing (c, d). 


Definition 3.2. A principal congruence formula (of type F) is a formula 
T(x, Y; u, v) 


of the form 


B{a & pr(za,B) & | & pilzi,@) © pisr(zins, B)| & Pa(zh, 8) © y} 


1<i<ni 


where 

{2i, zi} = {u, v}, 
1<i<_n. Let II be the set of principal congruence formulas in ¥(X) where X is an infinite 
set of variables. 


Theorem 3.3. Fora,b,c,d € A and A an algebra of type F, we have 
(a,b) € O(c, d) 

uf 
A F x(a, b,c, d) 


for some x € II. 


PROOF. This is just a restatement of 3.1. 


Definition 3.4. A variety V has definable principal congruences if there is a finite subset 
II) of II such that for A € V and a,b,c,d € A, 


(a,b) € O(c,d) iff AE m(a,b,c,d) 
for some 7 € II. 


Theorem 3.5 (McKenzie). If V is a directly representable variety, then V has definable 
principal congruences. 


PROOF. Choose finite algebras A;,..., A, € V such that for any finite B € V, 
Be TP({Ag,. : ., Ax}), 


and let 
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Now let 
K ={A? x ++ x AS: ji < mi, 1 <i < kh. 


As K is a finite set of finite algebras, it is clear that there is a finite Ip C II such that for 
A€EK anda,b,c,d€ A, 

(a,b) € O(c, d) 
iff 

AF 1(a, b,c, d) 


for some 7 € IIp. Now suppose B is any finite member of P({Aj,...,Ax}) and a,b,c,d€ B 
with 
(a,b) € O(c, d). 


Let 
B= Aj! x---x Aj. 


Let us rewrite the latter as 
Bu X-:: X Bis, X --- X Bu X --: X Bys,, 
with B,;; = Aj. For some 7 € II we have 
BE x(a, b,c, d). 


Let 7(z,y, u,v) be 


Wa. Ag P(E; Ys Us Oy Wy; w- 25 We), 


where ® is open. Let e1,...,e, € B be such that 


B. = O(a; bye,de7) oe). 


As there are at most m* possible 4-tuples 


(a(é, J), B,J), (4, 7), AG, J) 


for 1 < 7 < s; we can partition the indices 71,..., is; into sets Jj,,..., Jz, with t; < m# such 
that on each J;; the elements a, b,c,d are all constant. Thus in view of the description of 
congruence formulas we can assume the e’s are all constant on J;;. The set of elements of B 
which are constant on each J;; form a subuniverse C’ of B, and let C be the corresponding 
subalgebra. Then C € I(x), for if we select one index (7j)* from each J;; then the map 


defined by 
a(c)(i7)" = e((17)") 
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is easily seen to be an isomorphism. As aC € H(B), 


aC E (aa, ab, ac, ad); 


hence 
CE (a,b,c, d). 


It follows that for some 1* € Ip (as C € I(Kx)), 


Cr (a, bye). 


But then 


BE z*(a,b, c,d). 


Hence for any finite member B of V, the principal congruences of B can be described just 
by using the formulas in Ig. 
Finally, if B is any member of V and a,b,c,d € B with 


(a,b) € O(c, d) 


then for some 7 € II we have 
B= a(a,.b;¢.d). 


If 7 is 


UI hoe ye @ (hy, UO ay oe WU, 


with ® open, choose e;,...,e¢, € B such that 


BE ®(a, b,c, d,e1,...,€,). 


Let C be the subalgebra of B generated by {a,b,c,d,e1,...,e,}. Then 


C FE ®(a, b,c, d,e1,-.-,€,) 


so 

C i ma, b, C, d); 
hence for some 1* € Ip, 

CE n*(a,b,c,d), 
so 

BE 7z*(a,b,c,d). 


Thus V has definable principal congruences. 


Before proving Taylor’s Theorem we need a combinatorial lemma, a proof of which can 
be found in [3]. 
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Lemma 3.6 (Erdos). Let « be an infinite cardinal and let A be a set with |A| > 2", Ca 
set with |C| < «. Let A?) be the set of doubletons {c,d} contained in A with c 4 d. If a is a 
map from A) to C, then for some infinite subset B of A, 


a(B) = {e} 
for somee EC. 


Theorem 3.7 (Taylor). Let V be a variety of type F, and let . = max(w,|F|). If V has 
a subdirectly irreducible algebra A with |A| > 2", then V has arbitrarily large subdirectly 
irreducible algebras. 


Proor. If A € V is subdirectly irreducible and |A| > 2", then let a,b € A be such that 
O(a, b) is the smallest congruence not equal to A. As there are only & many formulas in II, 
and as 

A — x(a, b,c, d) 
for some 7 € II, if c 4 d, it follows from 3.3 and 3.6 that for some infinite subset B of A 
there is a 7* € II such that for c,d € B, if c#d then 


A= 7" (a, 6d): 


Given an infinite set J of new nullary function symbols with |J| = m and an infinite set of 
variables X, let J be 


{i# 7:14,7 €J andi ¥¢ j}U (Idy(X)) U {r*(a,6,7,7) 21,7 EJ andi Aj} U{a sb}. 


Then for each finite No C / we see that by interpreting the 7’s as suitable members of B, 
it is possible to find an algebra (essentially A) satisfying Mo. Thus © is satisfied by some 
algebra A* of type FUIU {a, b}. Let J C A* be the elements of A* corresponding to J, and 
let a,b again denote appropriate elements of A*. Then |/| = m, and a ¥ b. Choose @ to be 
a maximal congruence on A* among the congruences on A* which do not identify a and b. 
Then 7,7 € J andi ¥ 7 imply 
(1,9) € 9, 
as 
AY = 9e"(a, Dia) 


Consequently A*/@ is subdirectly irreducible and 


|A*/6| > || =m. 


This shows that V has arbitrarily large subdirectly irreducible members. 


The next result does not depend on principal congruence formulas, but does indeed nicely 
complement the previous theorem. 
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Theorem 3.8 (Quackenbush). Jf V is a locally finite variety with, up to isomorphism, 
only finitely many finite subdirectly irreducible members, then V has no infinite subdirectly 
irreducible members. 


PROOF. Let V* be the class of finite subdirectly irreducible members of V. If A € V then 
let K be the set of finitely generated subalgebras of A. By 2.14 we have 


A €ISPy(k), 
and from local finiteness 
KOC IPs Vv") CASPly"); 


hence 
A €ISPySP(V*), 


sO 
A € ISPPy(V*) 


by 2.23. As an ultraproduct of finitely many finite algebras is isomorphic to one of the 
algebras, we have 
A €ISP(V*); 


hence 
A €IPs(V*), 


so A cannot be both infinite and subdirectly irreducible. 
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EXERCISES 83 
1. Show that commutative rings with identity have definable principal congruences. 
2. Show that abelian groups of exponent n have definable principal congruences. 
3. Show that discriminator varieties have definable principal congruences. 


4. Show that distributive lattices have definable principal congruences. 
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5. Suppose V is a variety such that there is a first-order formula $(z, y, u,v) with 
(a,b) € O(c,d) + AF ¢(a,b, c,d) 
for a,b,c,d € A, A € V. Show that V has definable principal congruences. 


6. Show that a finitely generated semisimple arithmetical variety has definable principal 
congruences. 


7. Are elementary substructures of subdirectly irreducible [simple] algebras also subdi- 
rectly irreducible’ [simple]? What about ultrapowers? 


8. (Baldwin and Berman). If V is a finitely generated variety with the CEP (see II§5 
Exercise 10), show that V has definable principal congruences. 


84. Three Finite Basis Theorems 


One of the older questions of universal algebra was whether or not the identities of a finite 
algebra of finite type F could be derived from finitely many of the identities. Birkhoff proved 
that this was true if a finite bound is placed on the number of variables, but in 1954 Lyndon 
constructed a seven-element algebra with one binary and one nullary operation such that 
the identities were not finitely based. Murskii constructed a three-element algebra whose 
identities are not finitely based in 1965, and Perkins constructed a six-element semigroup 
whose identities are not finitely based in 1969. An example of a finite nonassociative ring 
whose identities are not finitely based was constructed by Polin in 1976. On the positive 
side we know that finite algebras of the following kinds have a finitely based set of identi- 
ties: two-element algebras (Lyndon, 1951), groups (Oates-Powell, 1965), rings (Kruse; Lvov, 
1973), algebras determining a congruence-distributive variety (Baker, 1977), and algebras 
determining a variety with finitely many finite subdirectly irreducibles and definable prin- 
cipal congruences (McKenzie, 1978). We will prove the theorems of Baker, Birkhoff, and 
McKenzie in this section. 


Definition 4.1. Let X be a set of variables and K a class of algebras. We say that Idx (X) 
is finitely based if there is a finite subset © of Idx (X) such that 


and we say that the identities of K are finitely based if there is a finite set of identities © 
such that for any X, 
& E Id«(X). 


!This is apparently an open problem. 
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Theorem 4.2 (Birkhoff). Let A be a finite algebra of finite type F and let X be a finite set 
of variables. Then Ida(X) is finitely based. 


ProoF. Let 6 be the congruence on T(X) defined by 


(p,q) € 8 


iff 
AE pq. 


(This, of course, is the congruence used to define Fy;a)(X).) As A is finite there are only 
finitely many equivalence classes of #. From each equivalence class of 6 choose one term. Let 
this set of representatives be Q = {q@,.--,@n}. Now let © be the set of equations consisting 
of 


oR. y ifz,yEX and (2,y) €8, 


Cc a ife@e X and (x, qi) € 6, 
nACHE ors rey) a Din+1 if f € Fn and (f (di; se Qi edi ch) So. 


Then a proof by induction on the number of function symbols in a term p € T(X) shows 
that if 


(P, qi) € O 
then 
UE pq 
But then 
UE pyq 
if 
AF pq, 
and as 
Ase Ds: 


Idx (X) is indeed finitely based. 


Theorem 4.3 (McKenzie). If V is a locally finite variety of finite type F with finitely many 
finite subdirectly irreducible members and if V has definable principal congruences, then the 
identities of V are finitely based. 


ProoF. Let Ho C II be a finite set of principal congruence formulas which show that V has 
definable principal congruences. Let Hp be {71,...,7n}, and define ® to be 


71 Vee: V Tn. 
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Then for A € V and a,b,c,d € A, 
(a,b) € O(c,d) 6 AE ®(a, b,c, d). 


Let Sj,...,S, be finite subdirectly irreducible members of V such that every finite sub- 
directly irreducible member of V is isomorphic to one of the §;’s. By 3.8 they are, up to 
isomorphism, the only subdirectly irreducible algebras in V. Let WV, be a sentence which 
asserts “the collection of (a,b) such that ®(a,b,c,d) holds is O(c, d),” ie. UV, can be 
valve} ®(u,0,4,0) & Ve®(x,2,u,v) & VaVy[®(2, y, u,v) > Oly, 2, u, v)] 
& VaVyV2[®(z, Y,U, v) & Py, 2,U, v) zs P(x, z,U, v)| 


& & & VarVy1...VinVyn & O(2;, Yi, U,V) HF). £.40)] 


FnZ#OfEFn 
Thus for A any algebra of type ¥, A | W, iff for all a,b,c,d€ A, 
(a,b) € O(c,d) & AE ®(a, b,c, d). 
Next let W2 be a sentence which says 
“an algebra is isomorphic to one of S;,...,5,,” 
(see §1 Exercise 3). Then let V3 be a sentence which says 


“an algebra satisfies V,, and if it is subdirectly irreducible 


then it is isomorphic to one of S;,...,5,.” 


For example, V3 could be 


WU, & ({Svsyle £ y & VuVo(u # v — O(z, y, u, v))] — Vo} V VaVy(a & y)). 


Let 4 be the set of identities of V over an infinite set of variables X. As 


uF Vs, 


there must be a finite subset Up of / such that 


No F V3 


by 2.13. But then the subdirectly irreducible algebras satisfying Ug will satisfy V3; hence 
they will be in V. Thus the variety defined by Ng must be V. 


Now we turn to the proof of Baker’s finite basis theorem. From this paragraph until the 
statement of Corollary 4.18 we will assume that our finite language of algebras is F, and 
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that we are working with a congruence-distributive variety V. Let po(x,y,Z),.--,;DPn(2, y, 2) 
be ternary terms which satisfy J6nsson’s conditions I1§12.6. 


Lemma 4.4. 


V F pilz,u, x) & pi(z,v, 2), Lean 
VExr x yY~ [pi(x, z,y) x pil(z, y, Y) Ves V pp (2; y) x Pala) 


PROOF. These are both immediate from II§12.6. 


The proof of Baker’s theorem is must easier to write out if we can assume that the p;’s 
are function symbols. 


Definition 4.5. Let F* be the language obtained by adjoining new ternary operation 
symbols t),...,tn-1 to F, and let V* be the variety defined by the identities / of type F 
over some infinite set X of variables true of V plus the identities 


ti(x,y, z) © pix, y, Z), 
1l<i<n-l. 
Lemma 4.6. Jf the identities %* of V* are finitely based, then so are the identities % of V. 
PRooF. Let &** be 
“U {ti(x, y, z) © pix, y,z):1<i<n—-IT, 
and let & be a finite basis for U*. Then 
Et Bh 
hence by 2.13 there is a finite subset 5* of &** such that 
Pe are 
Thus X* is a set of axioms for V*; hence there is a finite No C Y such that 
do U {t,(x, y, z) © pi(x,y, zz); 1<i<n-1} 
axiomatizes V*. Hence it is clear that 


No F 


as one can add new functions t; to any A satisfying Up to obtain A* with 


ATE yt, 
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sO 


Definition 4.7. Let 7T* be the set of all terms p(z, 7) of type F* such that (i) no variable 
occurs twice in p, and (ii) the variable x occurs in every nonvariable subterm of p (as defined 
in 11§14.13). 


Lemma 4.8. For A € V* anda,b,a’,b' € A we have 
O(a,b)NO(a' DY AA 
uf 


A & A756 [ti(p(a, 7), (a, @), p(b, Z)) # ta(p(a, Z), (0, B), p(b, Z))] 


for some p(x, 7), q(x,w) € T* and somei,l1<i<n-1. 


PROOF. (=) Suppose c 4 d and 
(c,d) € O(a, b) N O(a’, b’). 
Then we claim that for some p(z,7) € T*, for some j, and for some g from A, we have 
ti(¢,B(a, 3), 4) # t3(c,P(b,5),d). 
To see this first note that the equivalence relation on A generated by 
{(pla, 7), 6(b, #)) : BET", J from A} 
is O(a, b) (one can argue this in a manner similar to the proof of 3.1). As 
(c,d) € O(a, b) 


we see that for each 2, 


(ti(¢, C, d), til, d, d)) 
is in the equivalence relation generated by 
{ (ti(c, pla, 9), d), ti(c, p(b, 7) d)) : p € Lie g from A}. 


As c # d, for some j we know 


tj iG C, d) # ti(c, d, d) 


by 4.4; hence for some j, some g, and the same j, 


t;(c, p(a, J), d) - t,(c, B(b, g),d), 


264 V Connections with Model Theory 


proving the claim. By incorporating c,d into the parameters, we have a p € 7* and param- 
eters € such that 


pla, €) # p(b, €); 


and furthermore 


as 

(p(a, €), p(0, €)) € O(c, d) 
because of 4.4. Now starting with (p(a, €), p(b, &)) instead of (c,d), we can repeat the above 
argument to find gq € T7*,t; and f from A such that 


ti(p(a, ), g(a’, fF), v(b, &)) # tilp(a, &), a(b, fF), p(b, @)), 


as desired. 
(<) If for some i 


ti(p(a, ), g(a’, fF), v(b, &)) # tilp(a, &), a(b, fF), (b, @)), 


then, as the ordered pair consisting of these two distinct elements is in both O(a,b) and 
O(a’, b') by 4.4, we have 
O(a, b) N O(a’, W) ZA. 


Definition 4.9. Suppose the operation symbols in F¥* have arity at most r, with r finite. 
For m < w let T™ be the subset of 7* consisting of those terms p in 7* with no more than 
m occurrences of function symbols. Then define 6,,(2,y,u,v) to be 


\V ZS [ti(p(@, Z), a(u, B), v(y, 2) # ti(p(@, Z), a(v, B), ply, Z))] 
1<i<n-1 
p.qeTr, 
where the z’s come from {21,..., Zm,}, and the w’s come from {wy,...,Wmr}- 


The next lemma is just a restatement of Lemma 4.8. 
Lemma 4.10. For A € V* and a,b,a’',b! € A, we have 
O(a,b)NO(a' VY) AA 


uf 
A - 0,,(a, 6, a’, b’) 


for somem <w. 
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Definition 4.11. Let 6*, be the sentence 
VaVyVuvulom+i1(£, y, U,V) > bm(2, y, U, v)]- 
Lemma 4.12. (a) 
WV" on > Onaa 


form <w, and 
(b) for AE V*, if 


AE 6. 
and 
A F 6,(a, }, ¢, d), 
then 
A FE 6m(a, b,c, d) 
for k,m<w. 


PrRooF. To prove (a) suppose, for A € V*, 


Ako 


and, for some a,b,c,d € A, 
A. = 04450, b,c.d). 


We want to show 
A F 6m+1(a, 0, c,d). 


Choose p,q € Ty 49, fg € A, and i such that 


> > > > 


ti(p(a, f),a(c,9),p(0, f)) A tilpla, f), a(d, 9), pO, f)). 


Then one can find p’, qd € Ty, p",q"” € T., with 


a’ = p'(a, fay); 
b =p'(b, ft), 
c= dc, Gay), 
d' = q'(d, Gay). 


266 V Connections with Model Theory 


Then 


hence 
A Op (G65 70). 


As A — 0* it follows that 


Av E64 (a b5.c, a); 


so there are p,¢g € T*, and h, ke A, and j such that 


Now 
for suitable uw, and likewise 


for suitable v, so 
A — 6m+i(a, 5, , a), 


as was to be shown. 
Combining (a) with the fact that 


Ve Ok 2. Ok-+1) 


k < w, we can easily show (b). 


Definition 4.13. An algebra A is finitely subdirectly irreducible if for a,b,a’,b’! € A with 
a # b,a’ #0’ we always have 
O(a,b) N O(a’, b') AA. 


(Any subdirectly irreducible algebra is finitely subdirectly irreducible.) If V is a variety, 
then Vrs; denotes the class of finitely subdirectly irreducible algebras in V. 


Lemma 4.14. If Vg, is a strictly elementary class, then, for some no < w, 


Vist & (a By & uF v) > dno(@, Y, Us, v) 
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and 
VEO. 


PROOF. Let ® axiomatize Vfg;. Then the set of formulas 
{6 & (a#b&cH#d) & 7b6y(a, b,c, d) bnew 


cannot be satisfied by any algebra A and elements a,b,c,d € A in view of 4.10. Hence by 
the compactness theorem, there is an ng < w such that 


{® & (x x y & u x v) & Op ly 50) pray 


cannot be satisfied. By taking negations, we see that every algebra of type F* satisfies one 
of 
{® =o [(x # y & U x v) hn Om (25.9); UD) | anngs 


hence if A € V;ig, and a,b,c,d € A, we have 


AE \V (afb & c#d) > bn (a,b,¢, 4) 


m<no 
6) 
AK (atb& ced) > VV dn(a,b,c,d); 
m<no 
and as 
Viesr F Om — Om+1; 
we have 
AE (a#b&c#d) — d,,(a, b,c, d). 
Thus 


Vist = (@ Fy & uP v) > bno(@,y, u,v). 


Again if 
A € Vig: 


and a,b,c,d € A and 
A F 6no+1(4, 5, c, d) 


then 
O(a,b)NO(c,d) AA 


by 4.10, soa4#band cd. From the first part of this lemma we have 


A, = 05(G,0, 0). 
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Thus 
Vist = Oe. 


Now if 
A © Ps3(Vesr); 


say 
A< I] A; (as a subdirect product), 
i€l 
where A; € Vf, and if a,b,c,d € A and 


AF Ono +1(@, b, C, d), 


then for some p,q € Ty, 1, for some €, f from A, and for some 7, we have 


t;(p(a,@), a(c, fF), p(b, @)) & t;(p(a, @), a(d, f), p(b, @)); 
hence for some 7 € J, 
ti(p(a,@), a(c, F), p(b, €))(t) F t;(p(a, @), a(d, fF), p(b, @)) (2). 


Thus 
Ai F 9no+1(a(@), 6), (7), d(@)). 
As Vig; & 0,,, it follows that 


Ai F dno (a(t), (4), e(@), d(2)). 
We leave it to the reader to see that the above steps can be reversed to show 
A = 045(0,0, 6,0). 


Consequently, 
V" Fon: 


Definition 4.15. If V#,; is a strictly elementary class, let ®; axiomatize Vfg;. Let 2 be 
the sentence 


1<i<n—- 


VaVuVu & fla, 4, 2) os n(a0.2)] 


& VaVy c xy VV i;(x,2,y) # utara) 


1<i<n-1 
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and let ®3 be the sentence 

VavyVuvol(a # y & u #v) > On, (2, y, U, v)], 
where ng is as in 4.14. 
Lemma 4.16. If Vig, is a strictly elementary class, then 


Vi KE Ot, & G & (6; + 4,), 


where no and the ®; are as in 4.15. 


PROOF. We have 

Ve Os 
from 4.14 and 

V*E ® 


follows from 4.4. Finally, the assertions 


AE 9s, Ac YV* 


imply 
A € Veg; 


in view of 4.10; hence 
y* = ®3 ad Pj. 


The following improvement of Baker’s theorem (4.18) was pointed out by Jonsson. 


Theorem 4.17. Suppose V is a congruence-distributive variety of finite type such that Vrgr 
is a strictly elementary class. Then V has a finitely based equational theory. 


PROOF. Let p1,...,Pn—1 be the terms used in 4.4, and let V* be as defined in 4.5. Let ® 
axiomatize Vrg;. Then 


®& & fila, y, 2) a plz, y, 2) 


1<i<n— 


axiomatizes Viig,, So Vg, is also a strictly elementary class. Now let ®),®2,@3 and no be 
as in 4.15. If &* is the set of identities true of V* over some infinite set of variables, then 


a F On, & Py & (b3 — ®,) 


by 4.16. By 2.13 it follows that there is a finite subset &§ of &* such that 


by f= On. & P» & (®; = ®,). 
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We want to show that &§ axiomatizes V*, so suppose A is finitely subdirectly irreducible 
and A — &j. The only time we have made use of congruence-distributivity was to obtain 
terms for 4.4. All of the subsequent results have depended only on 4.4 (this is not surprising 
in view of Exercise 3). As ®, holds in the variety defined by Xj we can use these subsequent 
results. Hence if a,b,c,d € A anda #b,c # d, then 


A FE 6m(a, b,c, d) 


for some m < w by 4.10. As A - 67, we know 


A= Ona a. 0;e-0) 


by 4.12. Thus 

AE 93, 
and as 

A -= Pz <a ®,, 

it follows that 

AEF ®, 
This means 

A € Vegr; 


hence every subdirectly irreducible algebra satisfying Xp also satisfies *. In view of Birkhoff’s 
theorem (II§8.6), U5 is a set of axioms for V*. From 4.6 it is clear that V has a finitely based 
set of identities. 


Corollary 4.18 (Baker). If V is a finitely generated congruence-distributive variety of finite 
type, then V has a finitely based equational theory. 


Proor. The proof of Jénsson’s Theorem IV86.8 actually gives Veg; C HSP,,(4), where 
K generates V. If V is finitely generated, this means Vrs; = Vgr, a finitely axiomatizable 
elementary class. 
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EXERCISES §4 


1. Given a finite algebra A of finite type and a finite set of variables X, show that there 
is an algorithm to find a finite basis for Ida (X). 


2. Show that the identities of a variety are finitely based iff the variety is a strictly 
elementary class. 


3. (Baker). If V is a variety with ternary terms pj,...,)Pn—1 which satisfy the statements 
in Lemma 4.4, show that V is congruence-distributive. 


85. Semantic Embeddings and Undecidability 


In this section we will see that by assuming a few basic results about undecidability we will be 
able to prove that a large number of familiar theories are undecidable. The fundamental work 
on undecidability was developed by Church, Godel, Kleene, Rosser, and Turing in the 1930’s. 
Rosser proved that the theory of the natural numbers is undecidable, and Turing constructed 
a Turing machine with an undecidable halting problem. These results were subsequently 
encoded into many problems to show that the latter were also undecidable—some of the 
early contributors were Church, Novikov, Post, and Tarski. Popular new techniques of 
encoding were developed in the 1960’s by Ershov and Rabin. 

We will look at two methods, the embedding of the natural numbers used by Tarski, and 
the embedding of finite graphs used by Ershov and Rabin. 

The precise definition of decidability cannot be given here—however it suffices to think 
of a set of objects as being decidable if there is an “algorithm” to determine whether or not 
an object is in the set, and it is common to think of an algorithm as a computer program. 

Let us recall the definition of the theory of a class of structures. 


Definition 5.1. Let K be a class of structures of type £. The theory of K, written Th(i), 
is the set of all first-order sentences of type £ (over some fixed “standard” countably infinite 
set of variables) which are satisfied by K. 


Definition 5.2. Let A bea structure of type £ and let B be a structure of type £*. Suppose 
we can find formulas 


A(z) 
Fis) JOK Te Sai 1 
Sica a) forrE Ran >1 


Oe 


of type £* such that if we let 


Bo = {be B: BE A(b)} 
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then the set 
{((b1,. . ds dips b} € Bo x Bo: B = ® (di, . . ef acd) 


defines an n-ary function f on Bo, for f € F,,n > 1, and the set 


{(bt,...,bp) € B2: BE &,(b1,...,bn)} 


defines an n-ary relation r on Bo for r € R,,n > 1, such that by suitably interpreting the 
constant symbols of £ in Bo we have a structure Bo of type £ isomorphic to A. Then we 
say A can be semantically embedded in B, written A—p°B. The notation A= means 


A can be semantically embedded in some member of K, and the notation H =>>K means 
each member of H can be semantically embedded in at least one member of K, using the 


same formulas A, ®;, ®,. 


Lemma 5.3. Jf Gap sand H =z K, then Gaza kK, 1.e., the notion of semantic 
embedding is transitive. 


PROOF. (Exercise.) 


Definition 5.4. If K is a class of structures of type & and cq,...,¢, are symbols not 
appearing in £, then K(ci,...,C,) denotes the class of all structures of type LU{ci,..., cn}, 
where each c; is a constant symbol, obtained by taking the members B of K and arbitrarily 
designating elements c),...,¢, in B. 


Definition 5.5. Let N be the set of natural numbers, and let N be (N,+,-, 1). 


We will state the following result without proof, and use it to prove that the theory of 
rings and the theory of groups are undecidable. (See [33].) 


Theorem 5.6 (Tarski). Given K, if for some n < w we have N=>+ K(c,..-,Cn), then 
Th(K) is undecidable. 


Lemma 5.7 (Tarski). N Z = (Z,+,-,1), Z being the set of integers. 


sem 


Proor. Let A(x) be 


yee Ayale ey yn te + ya- ya + 1. 


By a well-known theorem of Lagrange, 
ZEA(n) iffne N. 


Let 6, (21,22, y) be 
Ly+rrwy SY, 
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and let ®.(x1, 22, y) be 
T° XQ Y. 


Then it is easy to see N>~Z. 


Theorem 5.8 (Tarski). The theory of rings is undecidable. 


PROOF. Z is a ring, so 5.6 applies. 


Remark. In the above theory of rings we can assume the language being used is any of the 
usual languages such as {+,-}, {+,-, 1}, {+,-,—,0, 1} in view of 5.6. 


Lemma 5.9 (Tarski). 
(Z, ee 1) sem (Z, a e 1), 
where ? denotes the function mapping a to a?. 
Proor. Let A(x) be 
eee 


let ®, (x1, 22, y) be 
Ly+rrwy Sy; 


and let ®.(x1, 22, y) be 
ytyt+aj tx & (11 +22). 


To see that the latter formula actually defines - in Z, note that in Z 


a-b=c@2c+a°+? = (a+b). 


Lemma 5.10 (Tarski). 
(Z, +57, 1) sem (Z, +5 1,1), 


where alb means a divides b. 
ProoF. Let A(z) be 
LST: 
let ®+ (x1, 22, y) be 
T+xXRy 
and let ®2(x,, y) be 
Val(ti + y)l2  ((rilz) & (#1 + 1IJz))] 
& Vuvov2[(((ut+a,~y) & (v+1224)) 


— (ulz @ (a|z & v|z))). 
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Then (a,b) holds for a,b € Z iff 


a+b=+a(a+1) 
b—a=+a(a-1), 


and thus iff 
b=a’. 


Lemma 5.11 (Tarski). Let Sym(Z) be the set of bijections from Z to Z, let o denote 
composition of bijections, and let x be the bijection defined by m(a) =a+1,a€ Z. Then 


(Z,+,|,1) <= (Sym(Z), 0, 7). 


sem 


ProoF. Let A(x) be 
LOMA OH; 


let @, (21, £2, y) be 
T1OM SY, 
and let ®)(x1, 22) be 


V2(a1 Oz 20%, > 1202 204d). 


For 0 € Sym(Z) note that 
oon =T0EG 


iff for a € Z, 
o(a+1)=o(a) +1; 
hence if 
Conant oS 
then 
o(a) = 0(0) +4, 
1.€., 
o = 7° 
Thus 
(Sym(Z), 0,7) F A(@) 
iff 


ao Ee {nm ne Z}. 
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Clearly @, defines a function on this set, and indeed 


(Sym(Z), °, T) is b, (1°, an, °°) 


iff 
a+b=e. 


Next we wish to show 
(Sym(Z), 0,7) & Bi(n*,n°) iff ald, 


in which case the mapping 
ar 1 


for a € Z gives the desired isomorphism to show 


(Z, +,|,1) —> (Sym(Z), 0,7). 


sem 


So suppose alb in Z. If o € Sym(Z) and 
gom°=T%00 


we have 
o(c+a)=o(c)+a 


for c € Z; hence 
o(c+d-a)=o(c)+d-a 


for c,d € Z, so in particular 


hence 


Thus 
alb => (Sym(Z), 0,7) & ®(2*, 7°). 


Conversely suppose 
(Sym(Z), 0,7) F (1°, 7”) 


for some a,b € Z. If b = 0 then alb, so suppose b ¥ 0. Let 


ct+ta ifalc 
plc) = | 


é ifafc 


for c € Z. Clearly p € Sym(Z). An easy calculation shows 


pon’ = 7" 0p; 
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hence looking at @; we must have 
ponm=n op. 


Now 
‘ _ jetat+d ifale 
monte) = 14) ifafe 
and 
i c+b+a_ ifalc+b 
pom(c) = 
c+b ifatct+o. 
Thus 


alc iff alc+8, 


for c € Z; hence alb. 


Corollary 5.12 (Tarski). The theory of groups is undecidable. 


ProoF. From 5.3, 5.7, 5.9, 5.10, and 5.11 we have N => (Sym(Z), 0,7). If K is the class 


of groups (in the language {-}) then (Sym(Z), 0,7) € K(c,); hence by 5.6 it follows that 
Th(4) is undecidable. 


A major result of J. Robinson was to show (N,+,-, 1) zp (Q,+,-, 1); hence the theory 
of fields is undecidable. 

Now we turn to our second technique for proving undecidability. Recall that a graph is 
a structure (Gr) where r is an irreflexive and symmetric binary relation. 


Definition 5.13. G'g, will denote the class of finite graphs. 
The following result we state without proof. (See [13]; Rabin [1965].) 
Theorem 5.14 (Ershov, Rabin). Jf we are given K, and for some n < w we have 
Greig ee (CS CG 
then Th(x) is undecidable. 
Corollary 5.15 (Grzegorczyk). The theory of distributive lattices is undecidable. 


Proor. If P = (P,<) is a poset, recall that a lower segment of P means a subset S of P 
such that ae P, b€ S anda < 6b imply a € S. In [83 Exercise 4 it was stated that a finite 
distributive lattice is isomorphic to the lattice of nonempty lower segments (under C) of the 
poset of join irreducible elements of the lattice; and if we are given any poset with 0 then 
the nonempty lower segments form a distributive lattice, with the poset corresponding to 
the join irreducibles. 
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Thus given a finite graph (G,r), let us define a poset P = (P, <) by 
P=GU {{a,b} C G: arb holds} U {0}, 
and require p < q to hold iff p = q,p = 0, or p € G and q is of the form {p,b}. Then in 
the lattice L of lower segments of P the minimal join irreducible elements are precisely the 
lower segments of the form {a,0} for a € G; and arb holds in G iff there is a join irreducible 
element above {a,0} and {b,0} in L. (See Figure 34 for the poset corresponding to the graph 
in Figure 30.) Hence if we let Irr(x) be 
VyVz(yV zea (yr uVvze2)) 
and then let A(x) be 
Irr(x) & Vyly Sv & Irr(y)) > (yR OVy a) & (« #0) 


and let ®,(21, 22) be 


(a1 € Xq) & Ay[Irr(y) & a <y & a2 < yl, 


where in the above formulas u < v is to be replaced by u A v = u, then we see that (G,r) is 
semantically embedded in (LZ, V, A, 0). 


{a,b} {b,c} {b,d} {c,d} 


0 
Figure 34 


Corollary 5.16 (Rogers). The theory of two equivalence relations is undecidable, i.e., if K 
is the class of structures (A,71,1T2) where r; and rz are both equivalence relations on A, then 
Th(K) is undecidable. 
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PROOF. Given a finite graph (G,r) let < be a linear order on G. Then let S be the set 
GU {(a, b) : arb}. 
Let the equivalence class a/r ; be 
{a} U {(a, b) : arb,a < b} U {(b, a) : arb,a < b} 
and let the equivalence class b/r2 be 
{b} U {(a, b) : arb,a < b} U {(b, a) : arb, a < bd}. 


(See Figure 35 for the structure (S,r1,r2) corresponding to the graph in Figure 30 with 
a<b<c<d. The rows give the equivalence classes of r;, the columns the equivalence 
classes of rz.) Then (S,1r1,72) is a set with two equivalence relations. Let 


T9 =T1NTo. 
Then the elements of G are precisely those s € S' such that 
s/ro = {s}, 
and for a,b € G,arb holds iff 
{ce € S: aric and crab} U {ce € S: bric and crea}| = 2. 
Thus the formulas 


A(x) = Vyl(2riy & xray) 92 y| 


®,.(21,%2) = dy nin # Yy2 & (fen & wrata) Vv 


( & ror & wrat) | } 


(G,r) = HS, 11,12). 


sem 


suffice to show 


© <a, b> 
oe 0 <b, a> 


o<b,c) © <b,d) 
ob o<c,b> o <d, by 


©<c,d> 
oc o<dc> 


Od 


Figure 35 
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A more general notion of a semantic embedding of a structure A into a structure B is 
required for some of the more subtle undecidability results, namely the interpretation of the 
elements of A as equivalence classes of n-tuples of elements of B. Of course this must all be 
done in a first-order fashion. For notational convenience we will define this only for the case 
of A a binary structure, but it should be obvious how to formulate it for other structures. 


Definition 5.17. Let A = (A,r) be a binary structure, and B a structure of type £. A can 


be semantically embedded in B, written A —> B, if there are £-formulas, for some n < w, 


ZEN Gi eee ce) 
®,.(@1, sey Un; Y1,--- Un) 
Eq(#1, seey Ens Y1,--- Yn) 


such that if we let 
D=1{{by,2.%,0;,) Bs BEA bis 50) t 


and if r? is the binary relation 


and = is the binary relation 


= = {(b,¢) €Dx D: BE Eq(b,2)} 


then = is an equivalence relation on D and we have 
AnD) = 


where 


r?/ = = {(b/ =z =) € D/=xD/=:r?n(b/ = xz =) ¥ }. 


A class H of binary structures can be semantically embedded into a class K of structures 
of type £, written H — > K, if there are formulas A, ®,,Eq as above such that for each 
structure A in the class H there is a member B of K such that A, ®,, Eq provide a semantic 
embedding of A into B. 


Using our more general notion of semantic embedding we still have the general results 
from before, two of which we repeat here for convenience. 


Theorem 5.18. (a) The semantic embeddability relation —> is transitive. 
(b) (Ershov, Rabin). If finite graphs can be semantically embedded into a class K(c1,..-,€n), 
then the first-order theory of K is undecidable. 


For the last part of this section we will look at results on Boolean pairs. 
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Definition 5.19. A Boolean pair is a structure (B, Bo, <) where (B, <) is a Boolean algebra 
(i.e., this is a complemented distributive lattice) and Bp is a unary relation which gives a 
subalgebra (Bo, <). The class of all Boolean pairs is called BP. 

The Boolean pairs (B, Bo, <) such that (B,<) is atomic (i.e., every element is a sup of 
atoms) and Bo contains all the atoms of (B,<) form the class BP!. 

The Boolean pairs (B, Bo, <) such that for every element b € B there is a least element 
b € Bo with b < 6 constitute the class BP™. 

The Boolean pairs (B, By, <) in BP™ such that (B,<), (Bo, <) are atomic form the class 
BP. 


Definition 5.20. Let Gj, be the class of finite graphs (G,r) such that r 4 ©. 


sem 


Lemma 5.21. Gg, —> Gi, (c). 


PROOF. (Exercise.) 


Adapting a technique of Rubin, McKenzie proved the following. 
Theorem 5.22 (McKenzie). The theory of BP? is undecidable. 


PROOF. Given a member G = (G,r) of Gj, let X = G x w. Two sets Y and Z are said 
to be “almost equal,” written Y = Z, if Y and Z differ by only finitely many points. For 
g €G, let C, = {(9,9) : 7 € w} C X, a “cylinder” of X. Let B be all subsets of X which 
are almost equal to a union of cylinders, i.e., all Y such that for some S CG,Y = Uses Cy 
Note that (B,C) is a Boolean algebra containing all finite subsets of X. 
To define Bo first let 
E = {{a,b}: (a,b) er}, 


the set of unordered edges of G, and then for each g € G choose a surjective map 
Gig Cs SEX w 
such that 
la, (e)I= e Pe. 
Then, for (e,7) € FE x w, let 
Deg = LJ a7 (e,3)). 


gEG 
This partitions X into finite sets D.,; such that for g € G, 


2 ifgee 


Dei, AC,| = 
Des al : if g Ze. 
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Let Bo be the set of finite and cofinite unions of D.,;’s. Note that (Bo, C) is a subalgebra of 
(B,C) as a Boolean algebra. 
Now we want to show (G,r) > (B, By, C): 


A(x) is “for all atoms y of Bo there are exactly two or three atoms of B below x A y” 

Eq(z, y) is Vzdu [u is an atom of Bo and there are exactly two or three atoms of B below 
rAyAuand (r©AyAzAuxrO0ortzAyAz’Au®0))| 

®,(z,y)isx #y & VuVo[Eq(u, x) & Eq(v, y) — (for some atom z of Bo there are exactly 
two atoms of B below each of uA z and uA z)]. 


sem 


To see that G —> (B, Bo, <) it suffices to check the following claims: 


(a) (B, Bo, <) E A(Z) implies 7 = C, for some g € G (just recall the description of the 
elements of B), 

(b) (B, Bo, <) KA(Gy) for g €G, 

(c) for X,Y such that A(X), A(Y) hold we have Eq(X,Y) iff X + Y, 

(d) for X,Y such that A(X), A(Y) hold we have ®,(X,Y) iff X = C,,Y = Cy for some 
g,g € G with (g,g') €7, 

(e) the mapping g +> C,/ = establishes G & (D,r?)/ =. 


Thus we have proved 
Gin = BP’. 


hence 
Gin(c) —> BP?(c); 


thus by Lemma 5.21 
Gan —> BP*(c). 


Theorem 5.23 (Rubin). The theory of CA, the variety of monadic algebras, is undecidable. 


sem 


ProoF. It suffices to show BP? =“ CA; as we have Ggn —> BP?(c,). Given (B, Bo, <)€ 
BP?, let c be the unary function defined on the Boolean algebra (B, <) by 


c(b) = the least member of Bo above b. 
Then (B,V,A,’,c,0,1) is a monadic algebra, and with 


A(x) defined as 7 ¥ x 


®p,(x) defined as x © c(x) 


we have, using the old definition of semantic embedding, (B, Bo, <) ar (B,V,/,',c,0,1). 


sem 
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Actually the class BP™ defined above is just an alternate description of monadic algebras, 
and BP? C BP™., 

Finally we turn to the class BP', a class which has played a remarkable role in the 
classification of decidable locally finite congruence modular varieties. 


Theorem 5.24 (McKenzie). The theory of BP! is undecidable. 
PROOF. Given a finite graph (G,r) with r £ @ first construct (B, Bo, C) as in 5.22, so B is 


a field of subsets of X = G x w. Let 7 be the first projection map from X x w to X, and 
define 


B* = {x7 (Y):Y € B} 
Beata (VY) :Y € Bo}. 
Then 
(B", Bo, C) = (B, Bo, S), 
and each nonzero member of B* contains infinitely many points from X x w. Now let 
BY ={Y CX xw:Y =Z for some Z € B*} 
By ={Y CX xw:Y =Z for some Z € Bet. 
Then (B**, Bj*,C) € BP! as all finite subsets of X x w belong to both B** and Be*, and 
furthermore 
(B™, By", C)/ = & (B*, Bo, S). 
Now “Y is finite” can be expressed for Y € B** by 
Vala <Y > 2 € Bo" 
as every nonzero element bp of Bo has an element b € B — Bo below it. Thus 


sem 


(B*, Bo, C) ae (BOT By Ss 


hence 
(B, Bo, C) as LB. Ba’, S 
This shows 
BP? =, BP’. 
hence 
Gin —> BP'(c,). 
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Recent Developments and 
Open Problems 


At several points in the text we have come very close to some of the most exciting areas 
of current research. Now that the reader has had a substantial introduction to universal 
algebra, we will survey the current situation in these areas and list a few of the problems 
being considered. (This is not a comprehensive survey of recent developments in universal 
algebra—the reader will have a good idea of the breadth of the subject if he reads Taylor’s 
survey article [35], Jonsson’s report [20], and the appendices to Gratzer’s book [16].) 


§1. The Commutator and the Center 


One of the most promising developments has been the creation of the commutator by Smith 
[1976]. He showed that, for any algebra A in a congruence-permutable variety, there is a 
unique function, [—,—], called the commutator, from (Con A) x (Con A) to Con A with 
certain properties. In the case of groups this is just the familiar commutator (when one 
considers the corresponding normal subgroups). Rather abruptly, several concepts one had 
previously considered to belong exclusively to the study of groups have become available on 
a grand scale: viz., solvability, nilpotence, and the center. Hagemann and Herrmann [1979] 
subsequently extended the commutator to any algebra in a congruence-modular variety. 
Freese and McKenzie [1987] have given another definition of the commutator, and of course 
we used their (first-order) definition of the center (of an arbitrary algebra) in I11§13. These 
new concepts have already played key roles in Burris and McKenzie [1981] and Freese and 
McKenzie [1981], [1987]. 


Problem 1. For which varieties can we define a commutator? 


Problem 2. Find a description of all A (parallel to I1§13.4) such that 7(A) = Va. 
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§2. The Classification of Varieties 


Birkhoff’s suggestion in the 1930’s that congruence lattices should be considered as funda- 
mental associated structures has proved to be remarkably farsighted. An important early 
result was the connection between modular congruence lattices and the unique factorization 
property due to Ore [1936]. A major turning point in showing the usefulness of classifying a 
variety by the behavior of the congruence lattices was Jénsson’s theorem [1967] that if V(A’) 
is congruence-distributive, then V() = IPsHSPy(K). 

The role of a single congruence, the center, is rapidly gaining attention. Let us call a 
variety modular Abelian if it is congruence-modular and, for any algebra A in the variety, 
Z(A) = Va. Such varieties are essentially varieties of unitary left R-modules. A variety V is 
said to be (discriminator) © (modular Abelian) if it is congruence modular and there are two 
subvarieties V,, V2 such that V; is a discriminator variety, V2 is a modular Abelian variety, 
and V = V, V V9. For such a variety V (see Burris and McKenzie [1981]) each algebra in V 
is, up to isomorphism, uniquely decomposable as a product of an algebra from V; and an 
algebra from V3. The importance of this class of varieties is discussed in 83 and §5 below. 
The following Hasse diagram (Figure 36) shows some of the most useful classes of varieties 
in research. 


all varieties 


congruence-modular 


congruence 
-distributive congruence- 
permutable 
arithmetical ee 
(discriminator) ® 
semisimple (modular Abelian) 
arithmetical 
discriminator 
modular 
generated by Abelian 


a primal algebra 


trivial varieties 
Figure 36 
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83. Decidability Questions 


Decidability problems have been a popular area of investigation in universal algebra, thanks 
to the fascinating work of Mal’cev [24] and Tarski [33]. Let us look at several different types 
of decidability questions being studied. 

(a) First-order Theories. In V85 we discussed the semantic embedding technique for 
proving that theories are undecidable. There has been a long-standing conviction among 
researchers in this area that positive decidability and nice structure theory go hand in hand. 
The combined efforts of Szmielew [1954], Ershov [1972] and Zamjatin [1978a] show that 
a variety of groups is decidable iff it is Abelian. This has recently been strengthened by 
McKenzie [1982c] as follows: any class of groups containing Ps(G), where G can be any 
nonabelian group, has an undecidable theory. In Burris and Werner [1979] techniques of 
Comer [1975] for cylindric algebras have been extended to prove that every finitely generated 
discriminator variety of finite type has a decidable theory. Zamjatin [1976] showed that a 
variety of rings has a decidable theory iff it is generated by a zero-ring and finitely many finite 
fields. Recently Burris and McKenzie [1981] have applied the center and commutator to prove 
the following: if a locally finite congruence-modular variety has a decidable theory, then it 
must be of the form (discriminator) ® (modular Abelian). Indeed there is an algorithm such 
that, given a finite set AK of finite algebras of finite type, one can decide if V(K) is of this 
form, and if so, one can construct a finite ring R with 1 such that V(‘) has a decidable 
theory iff the variety of unitary left R-modules has a decidable theory. This leads to an 
obvious question. 


Problem 3. Which locally finite varieties of finite type have a decidable theory? 
Zamjatin [1976] has examined the following question for varieties of rings. 


Problem 4. For which varieties of finite type is the theory of the finite algebras in the variety 
decidable? 


Actually we know very little about this question, so let us pose two rather special prob- 
lems. 


Problem 5. Do the finite algebras in any finitely generated arithmetical variety of finite type 
have a decidable theory? 


Problem 6. Do the finite algebras in any finitely generated congruence-distributive, but not 
congruence-permutable, variety of finite type have an undecidable theory? 


(b) Equational Theories. Tarski [1953] proved that there is no algorithm for deciding if an 
equation holds in all relation algebras (hence the first-order theory is certainly undecidable). 
Mal’cev [24] showed the same for unary algebras. Murskii [1968] gave an example of a 
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finitely based variety of semigroups with an undecidable equational theory. R. Freese [1979] 
has proved that there is no algorithm to decide which equations in at most 5 variables hold 
in the variety of modular lattices. From Dedekind’s description (see [3]) of the 28 element 
modular lattice freely generated by 3 elements it is clear that one can decide which equations 
in at most 3 variables hold in the variety of modular lattices. 


Problem 7. Is there an algorithm to decide which equations in at most 4 variables hold in 
modular lattices? 


(c) Word Problems. Given a variety V of type F, a presentation (of an algebra A) in V 
is an ordered pair (G, R) of generators G and defining relations R such that the following 
hold. 


(i) Ris a set of equations p(gi,.--,9n) © 9(g1,---;9Gn) of type FUG (we assume FANG = B) 
with g1,...,9n € G. 

(ii) If V is the variety of type JUG defined by NU R, where © is a set of equations defining 
V, then A is the reduct (see II§1 Exercise 1) of Fy(@) to the type of V. 


When the above holds we write Py(G, R) for A, and say “Py(G, R) is the algebra in V freely 
generated by G subject to the relations R.” If R = @ we just obtain Fy(G). A presentation 
(G, R) is finite if both G and R are finite, and in such case Py(G, R) is said to be finitely 
presented. 

The word problem for a given presentation (G, R) in V asks if there is an algorithm to 
determine, for any pair of “words,” i.e., terms r(g1,.--,9n),§(91,---,9n), Whether or not 


Fy(@): 27 (Gis-224 Gn) © S(Gu 23 Gn): 


If so, the word problem for (G, R) is decidable (or solvable); otherwise it is undecidable (or 
unsolvable). The question encountered in (b) above of “which equations in the set of variables 
X hold in a variety V” is often called the word problem for the free algebra Fy (X). The word 
problem for a given variety V asks if every finite presentation (G, R) in V has a decidable 
word problem. If so, the word problem for V is decidable; otherwise it is undecidable. 

Markov [1947] and Post [1947] proved that the word problem for semigroups is undecid- 
able. (A fascinating introduction to decidability and word problems is given in Trakhtenbrot 
[36].) Perhaps the most celebrated result is the undecidability of the word problem for 
groups (Novikov [1955]). A beautiful algebraic characterization of finitely presented groups 
Py(G, R) with solvable word problems is due to Boone and Higman [1974], namely Py(G, R) 
has a solvable word problem iff it can be embedded in a simple group S which in turn can be 
embedded in a finitely presented group T. This idea has been generalized by Evans [1978] 
to the variety of all algebras of an arbitrary type. Other varieties where word problems have 
been investigated include loops (Evans [1951]) and modular lattices (Hutchinson [1973], Lip- 
schitz [1974] and Freese [1979]). The survey article of Evans [14] is recommended. 


Problem 8. Is the word problem for orthomodular lattices decidable? 
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(d) Base Undecidability. This topic has been extensively developed by McNulty [1976] 
and Murskii [1971]. The following example suffices to explain the subject. Suppose one takes 
a finite set of equations which are true of Boolean algebras and asks: “Do these equations 
axiomatize Boolean algebras?” Surprisingly, there is no algorithm to decide this question. 


Problem 9. Can one derive the Linial-Post theorem [1949] (that there is no algorithm to de- 
termine if a finite set of tautologies with modus ponens axiomatizes the propo- 
sitional calculus) from the above result on Boolean algebras, or vice versa? 


(e) Other Undecidable Properties. Markov [26] showed that a number of properties of 
finitely presented semigroups are undecidable, for example there is no algorithm to determine 
if the semigroup is trivial, commutative, etc. Parallel results for groups were obtained 
by Rabin [1958]; and McNulty [1976] investigates such questions for arbitrary types. In 
[1975] McKenzie shows that the question of whether or not a single groupoid equation has 
a nontrivial finite model is undecidable, and then he derives the delightful result that there 
is a certain groupoid equation which will have a nontrivial finite model iff Fermat’s Last 
Theorem is actually false. For decidability questions concerning whether a quasivariety is 
actually a variety see Burris [1982b] and McNulty [1977]. A difficult question is the following. 


Problem 10. (Tarski). Is there an algorithm to determine if V(A) has a finitely based equa- 
tional theory, given that A is a finite algebra of finite type? 


84. Boolean Constructions 


Comer’s work [1971], [1974], [1975], and [1976] connected with sheaves has inspired a se- 
rious development of this construction in universal algebra. Comer was mainly interested 
in sheaves over Boolean spaces, and one might say that this construction, which we have 
formulated as a Boolean product, bears the same relation to the direct product that the 
variety of Boolean algebras bears to the class of power set algebras Su(/). Let us discuss the 
role of Boolean constructions in two major results. 

The decidability of any finitely generated discriminator variety of finite type (Burris and 
Werner [1979]) is proved by semantically embedding the countable members of the variety 
into countable Boolean algebras with a fixed finite number of distinguished filters, and then 
applying a result of Rabin [1969]. The semantic embedding is achieved by first taking 
the Boolean product representation of Keimel and Werner [1974], and then converting this 
representation into a better behaved Boolean product called a filtered Boolean power (the 
filtered Boolean power is the construction introduced by Arens and Kaplansky in [1948}). 

The newest additions to the family of Boolean constructions are the modified Boolean 
powers, introduced by Burris in the fall of 1978. Whereas Boolean products of finitely many 
finite structures give a well-behaved class of algebras, the modified Boolean powers give 
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a uniform method for constructing deviant algebras from a wide range of algebras. This 
construction is a highly specialized subdirect power, but not a Boolean product. 

The construction is quite easy. Given a field B of subsets of a set J, a subfield Bo of B, 
an algebra A, and a congruence @ on A, let 


A[B, Bo, 6]* = {f € A’: f7'(a) € B, f-*(a/@) € Bo, for a € A, and |Rg(f)| < wh. 


This is a subuniverse of A/, and the corresponding subalgebra is what we call the modified 
Boolean power A[B, Bo, 6]*. McKenzie developed a subtle generalization of this construction 
in the fall of 1979 for the decidability result of Burris and McKenzie mentioned in §3(a) 
above. His variation proceeds as follows: let B,Bo, A and @ be as above, and suppose 
A <S. Furthermore assume that Bo contains all singletons {i}, for 7 € J. Then the set 


A[B, Bo, 0, S]* = {f € 87 : dg € A[B, Bo, 6]* with [f 4 g] finite} 


is a subuniverse of S’. The corresponding subalgebra A[B, Bo, 0, S|* is the algebra we want. 


85. Structure Theory 


We have seen two beautiful results on the subject of structure theory, namely the Bulman- 
Fleming, Keimel and Werner theorem (IV89.4) that every discriminator variety can be rep- 
resented by Boolean products of simple algebras, and McKenzie’s proof [1982b] that every 
directly representable variety is congruence-permutable. McKenzie goes on to show that in 
a directly representable variety every directly indecomposable algebra is modular Abelian or 
functionally complete. 

The definition of a Boolean product was introduced in Burris and Werner [1979] as a 
simplification of a construction sometimes called a Boolean sheaf. Subsequently Krauss and 
Clark [1979] showed that the general sheaf construction could be described in purely algebraic 
terms, reviewed much of the literature on the subject, and posed a number of interesting 
problems. Recently Burris and McKenzie [1981] have proved that if a variety V can be 
written in the form JIT“(k’), with K consisting of finitely many finite algebras, then V is of 
the form (discriminator) ® (modular Abelian); and then they discuss in detail the possibility 
of Boolean powers, or filtered Boolean powers, of finitely many finite algebras representing 
a variety. The paper concludes with an internal characterization of all quasiprimal algebras 
A such that the [countable] members of V(A) can be represented as filtered Boolean powers 
of A, generalizing the work of Arens and Kaplansky [1948] on finite fields. 

Let us try to further crystallize the mathematically imprecise question of “which varieties 
admit a nice structure theory” by posing some specific questions. 


Problem 11. For which varieties does there exist a bound on the size of the directly inde- 
composable members? 
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Problem 12. For which varieties V is every algebra in V a Boolean product of directly 
indecomposable algebras? (Krauss and Clark [1979]) of subdirectly irreducible 
algebras? of simple algebras? 


Problem 13. For which finite rings R with 1 is the variety of unitary left R-modules directly 
representable? 


86. Applications to Computer Science 


Following Kleene’s beautiful characterization [1956] of languages accepted by finite state 
acceptors and Myhill’s study [1957] of the monoid of a language, considerable work has 
been devoted to relating various subclasses of regular languages and the associated class of 
monoids. For example Schiitzenberger [1965] showed that the class of star-free languages 
corresponds to the class of groupfree monoids. For this direction see the books [11], [12] of 
Eilenberg, and the problem set and survey of Brzozowski [7], [7a]. 


87. Applications to Model Theory 


Comer [1974] formulated a version of the Feferman- Vaught theorem (on first-order properties 
of direct products) for certain Boolean products, and in Burris and Werner [1979] it is 
shown that all of the known variations on the Feferman-Vaught theorem can be derived 
from Comer’s version. 

Macintyre [1973/74] used sheaf constructions to describe the model companions of certain 
classes of rings, and this was generalized somewhat by Comer [1976] and applied to varieties of 
monadic algebras. In Burris and Werner [1979] a detailed study is made of model companions 
of discriminator varieties, and then the concept of a discriminator formula is introduced to 
show that the theorems of Macintyre and Comer are easy consequences of the results on 
discriminator varieties. A formula T(z, y,u,v) is a discriminator formula for a class K of 
algebras if it is an existential formula in prenex form such that the matrix is a conjunction 
of atomic formulas, and we have 


K Er(a,y,u,v) > [ery &urvV(egy & ax). 


Problem 14. For which varieties V can one find a discriminator formula for the subclass of 
subdirectly irreducible members? 


Problem 15. Which finite algebras have a discriminator formula? 
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88. Finite Basis Theorems 


In V84 we looked at the three known general results on the existence of a finite basis for a va- 
riety (i.e., the variety is finitely axiomatizable). For many years universal algebraists hoped 
to amalgamate the Oates-Powell theorem [1965] (that a variety generated by a finite group 
has a finite basis) with Baker’s theorem (that a finitely generated congruence-distributive 
variety of finite type has a finite basis) into one theorem saying that a finitely generated 
congruence-modular variety of finite type would have a finite basis. This was shown im- 
possible by Polin [1976] who gave an example of a finitely generated but not finitely based 
(congruence-permutable) variety of nonassociative rings. 


Problem 16. Find a common generalization of the Oates-Powell theorem and Baker’s theo- 
rem. 


89. Subdirectly Irreducible Algebras 


Let F be a type of algebras, and let « = |F|+-w. As we have seen in V§3, Taylor [1972] proved 
that if a variety V of type F has a subdirectly irreducible algebra of size greater than 2" then 
V has arbitrarily large subdirectly irreducible members. Later McKenzie and Shelah [1974] 
proved a parallel result for simple algebras. In V§3 we proved Quackenbush’s result [1971] 
that if A is finite and V(A) has only finitely many finite subdirectly irreducible members 
(up to isomorphism), then V(A) contains no infinite subdirectly irreducible members. Using 
the commutator Freese and McKenzie [1981] proved that a finitely generated congruence- 
modular variety with no infinite subdirectly irreducible members has only finitely many finite 
subdirectly irreducible members. 


Problem 17. (Quackenbush). If a finitely generated variety has no infinite subdirectly irre- 
ducible members, must it have only finitely many finite subdirectly irreducible 
algebras? 
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